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DESCRIPTION 

MATERIALS AND METHODS FOR DELIVERY AND EXPRESSION OF 
HETEROLOGOUS DNA IN VERTEBRATE CELLS 

5 

The subject invention was made with government support under a research project 
supported by U.S. Department of Agriculture Grant No. 97-35302-4431 and National 
Institute of Health Grant No. P50-HL594 12-01. The government has certain rights in this 
invention. 
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Cross-Reference to a Related Application 
This application is a continuation-in-part of co-pending application Serial No. 
09/086,651, filed May 29, 1998. This application also claims the benefit of provisional 
patent application Serial No. 60/224,479; filed August 10, 2000. 

15 

Background of the Invention 
Gene therapy is a powerful concept just now beginning to see applications designed 
to treat human diseases such as genetic disorders and cancer. The introduction of genes into 
an organism can be achieved in a variety of ways, including virus-based vectors. Viral gene 
20 therapy vectors can either be designed to deliver and express genes permanently (stable 

integration of a foreign gene into host chromosome) or transiently (for a finite period of 
time). 

Current virus-based gene transfer vectors are typically derived from animal viruses, 
such as retroviruses, herpesviruses, adenoviruses, or adeno-associated viruses. Generally, 
25 these viruses are engineered to remove one or more genes of the virus. These genes may be 

removed because they are involved in viral replication and/or to provide the capacity for 
insertion and packaging of foreign genes. Each of these known vectors has some unique 
advantages as well as disadvantages. One primary disadvantage is an inability to readily 
package and deliver large DNA inserts that are greater than 10 kb in size. 



G: SH-API»S.lJF-221ClXC].\vpd[)Nh la 



2 UF-221C1XC1 

To illustrate the problem of capacity of most gene therapy vectors, one need only 
consider adeno-associated virus (AAV), one of the most promising of the gene therapy 
vectors. Adeno-associated virus (AAV) is a parvovirus which consists of a 4.7 kb single 
stranded DNA genome (Nienhuis, A.W., C.E. Walsh, J.M. Liu [1993] "Viruses as 
5 therapeutic gene transfer vectors" In: N.S. Young (ed.) Viruses and Bone Marrow, Marcel 

Decker, New York, pp. 353-414). The viral genome consists of the family of rep genes 
responsible for regulatory function and DNA replication and the cap genes that encode the 
capsid proteins. The AAV coding region is flanked by 145 nucleotide inverted terminal 
repeat (ITR) sequences which are the minimum cis-acting elements essential for replication 

10 and encapsidationof the genome. In the absence of a helper virus such as adenovirus, AAV 

causes a latent infection characterized by the integration of viral DNA into the cellular 
genome. The major advantages of recombinant AAV (rAAV) vectors include a lack of 
pathogenicity in humans (Berns, K.I. and R.A. Bohenzky [1987] "Adeno-associatedviruses: 
an update" Adv. Virus Res. 32:243-306), the ability of wild-type AAV to integrate stably into 

15 the long arm of chromosome 19 (Kotin, R.M., R.M. Linden, K.L Berns [1992] 

"Characterization of a preferred site on human chromosome lOq for integration of adeno- 
associated virus DNA by nonhomologous recombination" EMBO J 11:5071-5078), the 
potential ability to infect nondividing cells (Kaplitte/ al [1994] "Long term gene expression 
and phenotypic correction using adeno-associated virus vectors in the mammalian brain" 

20 Nat. Genet. 8: 148- 154), and broad range of infectivity. However, the packaging capacity of 

AAV limits the size of the inserted heterologous DNA to about 4.7 kb. Gene therapy vector 
systems are also needed that combine a large carrying capacity with high transduction 
efficiency in vivo. 

Until recently, complex insect viruses (entomoviruses) had not been considered for 
25 use as possible viral gene therapy vectors. In the past, studies of entomoviruses have mainly 

concentrated on their use as biopesticides, expression systems or taxonomic novelties to 
compare to their mammalian virus counterparts. 

The family Poxviridae comprisestwo subfamilies, the Chordopoxviridae(vertebrate) 
and the Entomopoxviridae(insect) viruses (EPVs). EPVs were first discovered in the early 
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1960's, and have subsequently been shown to have a worldwide distribution. The subfamily 
contains three genera; A, B and C which infect beetles, moths (lepidoptera) and 
grasshoppers, and midge flies respectively (Moyer, R.W. [1994] Entomopoxviruses, p. 392- 
397, Encyclopedia of Virology . R.G. Webster and A. Granoff (eds.). Academic Press Ltd, 
5 London). It should be recognized that classification within the three EPV genera is based 

solely on morphological and host range criteria and not molecular properties. Indeed, it is 
now clear that the group B viruses of butterflies and moths (lepidoptera) and grasshoppers 
(orthoptera) are quite distinct from one another (Afonso, C.L., E.R. Tulman, Z. Lu, E. Oma, 
G.F. Kutish, and D.L. Rock [1999J "The genome of Melanoplus sanguinipes 

10 Entomopoxvirus" J. Virol. 73:533-552). AmEPV was originally isolated in India from the 

red hairy caterpillar, and it is the prototype virus of this group. This is primarily because of 
its ability to be easily grown in cultured insect cells, although certain Choristoneura and 
Heliothis EPVs have also been shown to replicate in cell cultures at low levels (Fernon, C.A., 
A.P. Vera, R. Crnov, J. Lai-Fook, R.J. Osborne, and D.J. Dall [1995] "Replication of 

15 Heliothis armigera entomopoxvirus in vitro" J. Invertehr. Pathol. 66:216-223; Lytvyn, V.. 

Y. Fortin, M. Banville, B. Arif, and C. Richardson [1992] "Comparison of the thymidine 
kinase genes from three entomopoxviruses" J. Gen. Virol. 73:3235-3240). 

EPVs are the most distant relatives of mammalian poxviruses and exhibit both 
similarities and differences to the more commonly studied chordopoxviruses, such as 

20 vaccinia virus (VV). Similarities include morphology, a large linear double stranded genome 

(previously estimated at 225 kb for AmEPV, 190 kb for VV), common transcriptional 
regulation sequence motifs, non-spliced transcripts and a cytoplasmic site of replication. 
Differences include the G+C content of the viral DNA (a low 18% for AmEPV, 37% for 
VV), optimal growth temperatures (28°C for AmEPV, 37°C for VV), and host range. 

25 AmEPV does not replicate in vertebrate cells, and VV does not replicate in insect cells, 

although both viruses enter their respective non-permissive cells and initiate a replicative 
cycle (Langridge, W.H. [1983] "Detection of Amsacta moorei entomopoxvirusand vaccinia 
virus proteins in cell cultures restrictive for poxvirus multiplication" ,/ Invertehr. Pathol 
42:77-82). 
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Generally, growth of AmEPV in insect cell cultures is similar to that of vertebrate 
poxviruses in mammalian cells. Receptors mediating poxvirus attachment and entry appear 
to be widespread and common, as EPVs infect vertebrate cells and VV infects insect cells 
(Li, Y., R.L. Hall, and R.W. Moyer [1997] "Transient, nonlethal expression of genes in 
5 vertebrate cells by recombinant entomopoxvirusesV. Virol. 71 :9557-9562; Li, Y., S. Yuan, 

and R.W. Moyer [1998] "The non-permissive infection of insect (gypsy moth) LD-652 cells 
by vaccinia virus" Virology 248:74-82). It is assumed by analogy with the vertebrate 
poxviruses that AmEPV gene expression patterns can be classified as early, intermediate and 
late, but experimental data is minimal (Winter, J., R.L. Hall, and R.W. Moyer [1995] "The 

10 effect of inhibitors on the growth of the entomopoxvirus from Amsacta moorei in Lymantria 

dispar (gypsy moth) cells' 1 Virology 211:462-473). EPVs have been shown to contain 
vertebrate poxvirus promoter elements and early transcription termination motifs (Afonso, 
C.L., E.R. Tulman, Z. Lu, E. Oma, G.F. Kutish, and D.L. Rock [1999] "The genome of 
Melanoplus scmguinipes Entomopoxvirus" J. Virol. 73:533-552; Hall, R.L. and R.W. Moyer 

15 [1991] 'identification, cloning, and sequencing of a fragment of Amsacta moorei 

entomopoxvirus DN A containing the spheroidin gene and three vaccinia virus-related open 
reading frames" J. Virol. 65:6516-6527). The most unique feature of poxvirus replication 
is development mostly, if not exclusively, within the cytoplasm. As a consequence of 
cytoplasmic development, EPV promoters (like those of vertebrate poxviruses) are 

20 recognized only by the virally encoded transcription system. The general availability of 

poxvirus specific promoters, coupled with exclusion of the nuclear transcription apparatus 
are major advantages for engineering and control of foreign gene expression related to gene 
therapy applications. 

EPVs, like VV, contain a number of genes which are nonessential for growth in cell 
25 culture. Two examples are the thymidine kinase (TK) and spheroidin genes. The spheroidin 

gene can be viewed as a counterparts the polyhedrin and A-type (ATI) occlusion genes of 
baculoviruses and cowpox viruses respectively. VV also contains an ATI gene, but it is 
defective. Spheroidin is the most abundantly expressed AmEPV gene, and serves to 
"occlude" infectious virions within an environmentally resistant occlusion body. Both the 
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AmEPV TK and spheroidin gene can readily serve as sites for insertion and expression of 
foreign genes by utilizing standard plasmid-mediated recombination. 

Entomopoxvirus (EPVs) productively infect and kill only insects (Granados, R.R. 
[1981] "Entomopoxvirus infections in insects," in Pathogenesis of Invertebrate Microbial 
5 Disease, p. 102-126, Davidson, E.W. (ed.) New Jersey, Allanheld Totowa) and can be 

isolated from Amsacta moorei (AmEPV), the red hairy caterpillar. Entomopox viruses and 
vectors have been described (See, for example, U.S. Patent Nos. 5,721,352 and 5,753,258, 
the disclosure of which is incorporated herein by reference). Like other EPVs, AmEPV 
cannot productively infect vertebrate cells. Indeed, following addition of AmEPV to 

10 vertebrate (mouse L-929) cells at multiplicities up to 10 particles/cell, no changes in cellular 

morphology (asjudged by phase contrast microscopy) are detected (Langridge, W.H. [1983] 
"Detection of Amsacta moorei entomopoxvirus and vaccinia virus proteins in cell cultures 
restrictive for poxvirus multiplication",/ Invertebr. Pathol. 42:77-82). 

AmEPV infects vertebrate cells in a non-cytocidal manner and the infection is 

15 abortive. Like all poxviruses, the virus is cytoplasmic and does not normally enter the 

nucleus. A consequence of this unusual biology, is that all poxvirus mediated gene 
expression takes place in the cytoplasm in the infected cell. AmEPV promoters and those 
of the eucaryotic cell are completely different and cellular promoters arc not recognized by 
the AmEPV transcription machinery nor are AmEPV viral promoters recognized by RNA 

20 polymerase II of the host cell. 



25 



Brief Summary of the Invention 

?J2— ■ The subject invention concerns a novel viral vector system for gene therapy basgcK 

on an insect poxvirus designed to deliver genes for integration and^statrteTpermanent 
expression in vertebrate cells. In an exemplified embojiimrTTCa recombinant AmEPV vector 
was constructed that contains heterologj£«S"genes under the control of promoters that the 
drive the expression of thejietefologous genes in vertebrate cells. The^ gene and the gene 
encoding G4Jj^sTstance were used in an exemplified construct. The recombinant AmEPV 
was usea to infect vertebrate cells and following infection the cells were transferred to media 
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containing G4 1 8. Cells expressing both GFP and G4 1 8 resistance were obtained. Thi^th^ 
vectors of the subject invention canj)ejjs£d~^^ segments for the 

ejogi«ecriTTgl5r^ 

The subject invention also concerns cells that have been infected with or transformed 
5 with a recombinant vector of the present invention. The subject invention also concerns 

methods for providing gene therapy for conditions or disorders of an animal requiring 
therapy, such as genetic deficiency disorders. 

In addition, the subject invention concerns novel AmEVP polypeptides and the 
polynucleotide sequences which encode these polypeptides. The AmEPV polynucleotide 
10 sequences of the subject invention encode a triacylglyceride lipase (SEQ ID NO: 1), a 

Cu 7Zn +t superoxide dismutase (SOD) (SEQ ID NO: 2), a CPD photolyase (SEQ ID NO: 
3), a baculovirus-like inhibitor of apoptosis(IAP) (SEQ ID NO: 4), two poly(A) polymerase 
small subunits (SEQ ID NOS: 5 and 6), two DNA polymerases (SEQ ID NOS: 7 and 8), an 
ABC transporter-likeprotein(SEQ ID NO: 9), a Kunitz-motif protease inhibitor (KPI) (SEQ 
15 ID NO: 10), and a poly(A) polymerase large subunit (SEQ ID NO: 1 1). 

In addition, the subject invention concerns isolated AmEPV polypeptides encoded 
by the polynucleotide sequences of the subject invention, including a triacylglyceride lipase 
(SEQ ID NO: 12), a Cu 7Zn f superoxide dismutase (SOD) (SEQ ID NO: 13), a CPD 
photolyase (SEQ ID NO: 14), a baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 
20 15), two poly(A) polymerase small subunits (SEQ ID NOS: 16 and 17), two DNA 

polymerases (SEQ ID NOS: 18 and 19), an ABC transporter-like protein (SEQ ID NO: 20), 
a Kunitz-motif protease inhibitor (KPI) (SEQ ID NO: 21), a poly(A) polymerase large 
subunit (SEQ ID NO: 22) and other AmEPV polypeptides. 

The subject invention further pertains to other entomopoxvirus sequences. 
25 Polynucleotides of the subject invention include, for example, sequences identified in the 

attached sequence listing, as well as the tables and figures and described by open reading 
frame position within the genome. 

In addition, the subject invention includes polynucleotides which hybridize w ith other 
polynucleotides of the subject invention. 
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Polynucleotide sequences of this invention have numerous applications in techniques 
known to those skilled in the art of molecular biology having the benefit of the instant 
disclosure. These techniques include their use as insertion sites for foreign genes of interest 
hybridization probes, for chromosome and gene mapping, in PCR technologies, and in the 
5 production of sense or antisense nucleic acids. 

Brief Description of the Drawings 
The file of this patent contains at least one drawing executed in color. Copies of this 
patent with color drawings(s) will be provided by the Patent and Trademark Office upon 
10 request and payment of the necessary fee. 

Figure 1 shows a physical map of an exemplified recombinant vector of the subject 
invention (pAmEPV TKUF5) in which a portion of the plasmid pTKUF5 has been cloned 
within the AmEPV TK gene flanking regions. TR is the AAV terminal repeat; pA is a 
polyadenylationsite; SD/SA is the SV40 late splice donor, splice acceptor sequence. GFP, 
15 the green fluorescent protein gene, is under the control of a CMV promoter. Neo, the 

neomycin resistance gene, is under the control of a herpes TK gene promoter. 

Figure 2 shows an electrophoretic analysis of transformed mammalian cell lines. 
Each lane contains HindlU digested genomic DNA. Lane P contains genomic DNA from 
293 cells and pTR-UF5 plasmid, as a positive control. Lanes Al through A5 contain DNA 
20 extracted from transformed cell lines made by recombinant AmEPV (AmEPVpTKUFS) 

infection. Lanes Bl through B6 contain DNA obtained from cell lines transfected with 
plasmid pTR-UF5. 

Figure 3 shows expression of lacZ in recombinant AmEPV-infected mammalian 
cells. CV-1 cells were mock infected (A) or infected with various AmEPV lacZ 
25 recombinants, where lacZ was under the control of the cowpox virus late ATI gene promoter 

(B), the late AmEPV spheroidin promoter (C), the M. melonontha early fus promoter (D) or 
the AmEPV early esp promoter (E). Infection of human Huh-7 liver cells with the AmEPV 
TKe.v/7-lacZ recombinant is also shown as an additional control (F). The infected cell 
monolayers were stained with X-gal 24h postinfection. 
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Figure 4 shows the survival of mammalian cells following infection by recombinant 
AmEPV TKesp-gfp. Subconfluent CV-1 cells were infected with AmEPV TKesp-gfp at an 
m.o.i. of 1 PFU/cell. The individual fluorescent cells were located and followed over a period 
of two to three days and periodically photographed with a fluorescent microscope. One 
5 fluorescent cell, identified 1 8 hours post infection (A), had divided into two cells by 24 (B) 

to 26 (C) h postinfection and by 50 h had become a small cluster of dividing cells (D). 

Figure 5 shows AmEPV-mediated P-galactosidase expression in the muscle of 
mouse. 2X10 6 PFU (1 00 |il) of recombinant AmEPV -esplacZ was injected into the muscle 
of the hind leg of a mouse. As a control, mice were injected with the same amount of 

10 recombinant AmEPV -SPHlacZ or 100 of PBS. Two days later, the mice were sacrificed, 

the muscle was excised from the injected area and cut into small pieces, and fixed with 3% 
formaldehyde for 30 min. then stained with X-gal. The muscle injected with recombinant 
AmEPV pTK-esplacZ showed P-galactosidase expression. No p-galactosidase expression 
was observed in control mice. 

15 Figure 6 shows transformed 293 cells (A) derived from the colony infected with 

recombinant AmEPV-TKUF5 which are G4 1 8 resistant showing that cells are GFP positive, 
as well as non-fluorescent, non-transformed 293 cells (B). 

Figure 7 shows a linear map of the AmEPV genome. 0-1 39440(A ) and 139441- 
232392(B). Predicted ORFs are numbered consecutively from left to right based upon the 

20 position of the initiating methionine codon. ORFs transcribed in a rightward direction are 

shown above the horizontal line designating the viral genome; ORFs transcribed to the left 
are below. ITRs are indicated by heavy black arrows. A distance of lkb is as shown. ChPV 
homologsare indicated with red numbers, additional MsEPV homologs are indicated with 
purple numbers. Some ORFs have been assigned function based upon BLAST data. 

25 Figure 8 shows a comparison of the genomic organization of AmEPV, MsEPV and 

VV. AmEPV ITRs are positioned at the termini of the viral genome as indicated. AmEPV 
genes which have homology to VV genes are depicted in (A). AmEPV genes which have 
homology to MsEPV are depicted in (B). Genes in the AmEPV genome common to both 
MsEPV and VV are in (C). Unique genes encoded by AmEPV are shown in (D). 
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Figure 9 shows a comparison of the spatial distribution of homologous genes 
between AmEPV, MsEPV and VV. A random sampling of genes conserved within the 
genomes of all three indicated viruses were plotted on the 1 1 9kb genome of VV, the 232kb 
AmEPV genome, and the 236kb MsEPV genome. From left to right on the AmEPV 
5 genome, the genes shown and their BL AST-assigned function are: AMV016, thymidine 

kinase; AMV035, membrane protein; AMV038, PAP large subunit; AMV050, DNA 
polymerase; AMV051, RP035; AMV066, RP0132; AMV105, VETF-L; AMV122, 
rifampicin resistance; AMV 1 38, no BLAST-assignedfunction; AMV 1 50, ATP/GTP binding 
protein; AMV 1 66, RPO 1 9; AMV 181, core protein; AMV 1 86, no BL AST-assigned function; 

10 AMV205, VLTF-3 ; AM V22 1 , RPO 1 47; AMV232, membrane protein; AM V243, membrane 

protein; AMV249, no BLAST-assigned function. Plots compare both orientations of the 
AmEPV genome. (A) left to right, (B) right to left. 

Figure 10 shows residues shared between poxvirus poly (a) polymerase subunit 
homologs. Consensus shows the conservation between all five sequences. Insect consensus 

15 shows identity among the four EPV ORFs. AmEPV consensus displays identities between 

the two AmEPV subunits. 

Figure 11 shows the transmembrane domains possessed by the AmEPV ABC 
transporter protein. This graphic was produced by the THAMM program (Sonnhammer, E. 
L. L., Hejne, G., and Krogh, A. [1998] fci A hidden Markov model for predicting 

20 transmembrane helices inprotein sequences" Proc. of Sixth Int. Conf. on Intelligent Systems 

for Molecular Biology (J. Glasgow, T. Littlejohn, F. Major, R. Lathrop, D. Sankoff, and C. 
Sensen, Eds.), pp. 175-182. AAAI press, Menlo Park, CA.). The regions of the protein 
indicated by the thirteen bars can be seen to have a probability of 1 as transmembrane 
domains. Although not shown in this figure, the areas between these domains (residues 432- 

25 601 and 1097-1285) contain ABC transporter (ATP binding) motifs (Prosite PS0021 1). 

Figure 12 shows the amino acid sequence of the AmEPV serine protease inhibitor. 
Amino acid abbreviations are standard. The Kunitz family signature (Prostite PS00280) is 
shown underlined and italicized from residues 55 to 73. 
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° ^\ Brief Description of the Sequences 

SEQ ID NO: 1 is the nucleotide sequence of the gene encoding AmEPV 
triacylglyceride lipase. \ 

SEQ ID NO: 2 is the nucleotide sequence of the gene encoding AmEPV OT7Zn^ 
5 superoxide dismutase (SOD). 

SEQ ID NO: 3 is the nucleotide sjequence of the gene encoding AmEPV CPD 
photolyase. 

SEQ ID NO: 4 is the nucleotide sequence of the gene encoding AmEPV baculovirus- 
like inhibitor of apoptosis (IAP). j 
10 SEQ ID NO: 5 is the nucleotide/ sequence of the gene encoding a first AmEPV 

poly(A) polymerase small subunit. / 

SEQ ID NO: 6 is the nucleotide/sequence of the gene encoding a second AmEPV 
poly(A) polymerase small subunit. 

SEQ ID NO: 7 is the nucleotide sequence of the gene encoding a first AmEPV DNA 
15 polymerase. 

SEQ ID NO: 8 is the nucleotide sequence of the gene encoding a second AmEPV 
DNA polymerase. 

SEQ ID NO: 9 is the nucleotide sequence of the gene encoding AmEPV ABC 
transporter-like protein. 

20 SEQ ID NO: 10 is the micleotide sequence of the gene encoding AmEPV Kunitz- 

motif inhibitor (KPI). / 

SEQ ID NO: 11 is the Nucleotide sequence of the gene encoding AmEPV poly(A) 
polymerase large subunit. f 

SEQ ID NO: 12 is the Lm'mo acid sequence for the AmEPV triacylglyceride lipase. 
25 SEQ ID NO: 13 is the amino acid sequence for the AmEPV Cu 7Zn superoxide 

dismutase (SOD). 

SEQ ID NO: 14 is the amino acid sequence for the AmEPV CPD photolyase. 
SEQ ID NO: 15 is tie amino acid sequence for the AmEPV baculovirus-like 
inhibitor of apoptosis (IAP). 
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SEQ ID NO: 16 is the amino acid sequence for the first AmEPV poly(A) polymery 
small subunit. 

SEQ ID NO: 17 is the amino acid sequence for the second tyx&PV poly(A) 
polymerase small subunit. 
5 SEQ ID NO: 18 is the amino acid sequence for the finjj^CmEPV DNA polymerase. 

SEQ ID NO: 19 is the amino acid sequerpe Tor the second AmEPV DNA 
polymerase. 

SEQ ID NO: 20 is the amino aci^equence for the AmEPV ABC transporter- like 

protein. 

10 SEQ ID NO: 21 isj£h^ amino acid sequence for the AmEPV Kunitz-motif inhibitor 

(KPI). 

SEQ IJKNO: 22 is the amino acid sequence for the AmEPV poly(A) polymerase 
large subunit. 

>EQ ID NOS: 23-27 is the nucleotide sequence of the AmEPV genome. 
15 

Detailed Disclosure of the Invention 
The subject invention concerns three aspects of entomopoxviruses (EPVs) as novel 
recombinant vectors: (1) As a system for the expression of high levels of foreign proteins. 
(2) for the transient expression of foreign genes in mammalian cells and (3) for the stable 
20 transformation of vertebrate cells for the long term expression of foreign proteins. In 

addition, the subject invention provides the nucleotide sequence of the entire genome of 
genus B entomopoxvirus from Amsacta moorei (AmEPV). Accordingly, the subject 
invention also concerns isolated polynucleotides encoding AmEPV proteins. 

The subject invention concerns novel recombinant vectors and methods for delivery 
25 and expressionof heterologous polynucleotides^ vertebrate cells. The recombinant vectors 

of the subject invention provide for stable integration and expression of heterologous DNA 
in the host cell. Advantageously, the vectors of the invention are adapted for accepting large 
heterologous polynucleotide inserts which can be delivered in an infected or transformed cell 
and expressed in a stable fashion. The subject invention can be used to provide gene therapy 
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for conditions or disorders of vertebrate animals, such as a mammal or human, that is in need 
of such therapy. 

One aspect of the subject invention concerns a recombinant EPV vector which can 
optionally include heterologous DNA which can be expressed in a cell infected or 
5 transformed with the subject vector. Preferably, the EPV vector is derived from AmEPV. 

The recombinant EPV vectors of the present invention can optionally include inverted 
terminal repeat (ITR) sequences of a virus, such as, for example, adeno-associated virus, that 
flank the heterologous DNA insertion site on the vector. Thus, when the heterologous DNA 
is cloned into the recombinant EPV vector, the heterologous DNA is flanked upstream and 

10 downstream by the ITR sequences. 

In an exemplified embodiment, the subject vectors comprise heterologous DNA 
inserted within the vector. The heterologous DNA contained within the recombinant vectors 
of the invention can include polynucleotide sequences which encode a biologically 
functional protein. Preferably, the polynucleotides encode proteins which can provide 

1 5 therapeutic replacement or supplement in animals afflicted with disorders which result in the 

animal expressing abnormal or deficient levels of the protein that are required for normal 
biological function. Proteins encoded by the heterologous DNA can include, but are not 
limited to interleukins, cytokines, growth factors, interferons, enzymes, and structural 
proteins. Proteins encoded by the heterologous DNA can also include proteins that provide 

20 a selectable marker for expression, such as antibiotic resistance in eukaryotes. 

In a preferred embodiment, heterologous DNA within the subject vectors is operably 
linked with and under the control of regulatory sequences, such as promoters. The 
recombinant vectors of the invention preferably comprises a constitutive or regulatable 
promoter capable of promoting sufficient levels of expression of the heterologous DNA 

25 contained in the viral vector in a vertebrate cell. Promoters useful with the subject vectors 

include, for example, the cytomegalovirus (CMV) promoters and the herpes TK gene 
promoter. The vectors can also include other regulatory elements such as introns inserted 
into the polynucleotide sequence of the vector. 
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The strategy for generation of recombinant viruses is identical to that used for VV 
virus and takes advantage of the high levels of recombination with transfected plasmids 
mediated by these viruses. The basic procedure utilizes transfection of AmEPV-infected 
cells with an appropriately designed shuttle vector. Insertion of foreign genes occurs within 
5 a non-essential gene (e.g. , spheroidin or TK). Because of the cytoplasmic nature of AmEPV, 

it is necessary to place all foreign genes under control of an AmEPV (early or late) poxvirus 
promoter. Recombinants are selected and subjected to three rounds of plaque purification 
before use. 

The subject invention also concerns cells containing recombinant vectors of the 

10 present invention. The cells can be, for example, vertebrate cells such as mammalian cells. 

Preferably, the cells are human cells. Cell lines infected or transformed with the recombinant 
vectors of the present invention are also within the scope of the invention. 

The recombinant vectors of the present invention can be introduced into suitable cells 
or cell lines by methods known in the art. If the recombinant vectors are packaged in viral 

15 particles then cells or cell lines can be infected with the virus containing the recombinant 

vector. Methods contemplated for introducing recombinant vector into cells or cell lines also 
include transfection, transduction and injection. For example, vectors can be introduced into 
cells using liposomes containing the subject recombinant vectors. Recombinant viral 
particles and vectors of the present invention can be introduced into cells by in vitro or in 

20 vivo means. 

Infection of vertebrate cells is non-permissive, in that early but not late AmEPV gene 
expression occurs (Li, Y., R.L. Hall, R.W. Moyer [1997] 'Transient, nonlethal expression 
of genes in vertebrate cells by recombinant entomopoxviruses" J. Virol. 71:9557-9562). 
Specifically, if a reporter gene, such as IacZ y is driven by a late poxvirus promoter, such as 

25 either the AmEPV spheroidin or cowpox virus ATI (A-type Inclusion) promoter, no 

expression of galactosidaseis observed. If however, the lacZ is driven instead by either of 
two early EPV promoters (the Melolonthamelohmtha EPV fusolingene promoter (Gauthier, 
L., F. Coussrans, J.C. Veyrunes, M. Bergoin [1995] 'The Melolontha melolontha 
entomopoxvirus(MmEPV) fusolin is related to the fusolins of lepidoptera EPVs and to the 
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37 K baculovirus glycoprotein" Virology 208:427-436) or the 42 kDa early AmEPV protein 
(Li ef ai [1997] supra), high levels of galactosidase in the recombinant AmEPV infected 
vertebrate cells are observed. These results provide clear evidence of AmEPV entry into 
vertebrate cells followed by early, but not late, viral gene expression. 
5 It has also been found that vertebrate cells survive infection by AmEPV. If CV-1 

cells are infected with an AmEPV recombinant which contains the green fluorescent protein 
(GFP) gene regulated by the 42 kDa AmEPV early promoter (also called the esp promoter), 
single, fluorescent cells are initially observed which then proceed to grow and divide, 
ultimately forming small clusters of fluorescent cells. Therefore, AmEPV enters vertebrate 

10 cells, to produce a non-permissive, abortive infection, early viral genes are expressed and 

infected cells appear to survive and continue to divide. These properties plus a very large 
capacity of the virus for foreign genes make AmEPV an excellent vector for delivery of 
genes for expression in a transient fashion. 

However, in addition to the ability to express foreign genes in a transient fashion, it 

15 has been found that AmEPV vectors of the subject invention have the ability to stably 

transform cells and express genes in a long term fashion as well. The data presented within 
the Examples (e.g.. Example 2) and accompanying Figures (e.g.. Figure 2) confirm that the 
AmEPV vectors of the subject invention can be used to deliver DNA which subsequently 
integrates into DNA of the mammalian cell nucleus. The ability of AmEPV to deliver DNA 

20 to mammalian cells creates endless opportunity for use of the vector in the stable 

transformation and engineering of vertebrate cells. 

The Examples describe methodology for growth, titration and preparation of 
recombinant AmEPV, as well as transient expression of AmEPV in vertebrate cells, the use 
of AmEPV to stably transform mammalian cells, and potential uses of AmEPV vectors. 

25 In addition to entomopoxviruses(EPVs) as novel recombinant vectors, and methods 

of their use, the subject invention provides the nucleotide sequence of the entire genome of 
the genus B entomopoxvirus from Amsacta moorei (AmEPV). This enhances the value of 
AmEPV as a vector, and particularly as a gene therapy vector, in a number of ways. For 
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example, specific knowledge of the AmEPV genome facilitates the identification of 
additional sites which may be used as insertion sites for foreign genes of interest. 

In addition, the subject invention concerns novel AmEVP polypeptides and the 
polynucleotide sequences which encode these polypeptides. The AmEPV polynucleotide 
5 sequences of the subject invention include polynucleotides encoding a triacylglyceride 

lipase (SEQ ID NO: 1), a Cu 7Zn +t superoxide dismutase (SOD) (SEQ ID NO: 2), a CPD 
photolyase (SEQ ID NO: 3), a baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 
4), two poly(A) polymerase small subunits (SEQ ID NOS: 5 and 6), two DNA polymerases 
(SEQ ID NOS: 7 and 8), an ABC transporter-like protein (SEQ ID NO: 9), a Kunitz-motif 

10 protease inhibitor (KPI) (SEQ ID NO: 1 0), and a poly(A) polymerase large subunit (SEQ ID 

NO: 1 1 ) and other polynucleotides. 

In addition, the subject invention concerns isolated AmEPV polypeptides encoded 
by the polynucleotide sequences of the subject invention, including a triacylglyceride lipase 
(SEQ ID NO: 12), a Cu 7Zn superoxide dismutase (SOD) (SEQ ID NO: 13), a CPD 

15 photolyase (SEQ ID NO: 14), a baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 

15), two poly(A) polymerase small subunits (SEQ ID NOS: 16 and 17), two DNA 
polymerases (SEQ ID NOS: 18 and 19), an ABC transporter-like protein (SEQ ID NO: 20), 
a Kunitz-motif protease inhibitor (KPI) (SEQ ID NO: 21 ), and a poly (A) polymerase large 
subunit (SEQ ID NO: 22) and other AmEPV polypeptides. 

20 The subject invention includes other AmEPV sequences, as described in Table 1 , for 

example. In addition, the subject invention includes polynucleotides which hybridize with 
other polynucleotides of the subject invention. 

The genome of the genus B entomopoxvirusfrom Amsacta moorei (AmEPV) (SEQ 
ID NOS: 23-27) was sequenced and found to contain 232,392 bases with 279 unique open 

25 reading frames (ORFs) of greater than 60 amino acids. The central core of the viral 

chromosome is flanked by 9.4kbp inverted terminal repeats (ITRs), each of which contain 
1 3 ORFs, raising the total number of ORFs within the viral chromosome to 292. Default E 
(EXPECT) values of <0.0 1 were used to define homology to sequences in current databases. 
ORFs lacking homology to other poxvirus genes were shown to comprise 33.6% of the viral 
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genome. Approximately 28.6% of the AmEPV genome (52 AmEPV ORFs) encodes 
homologues of the mammalian poxvirus co-linear core genes, which are found dispersed 
throughout the AmEPV chromosome. There is also no significant gene order conservation 
between AmEPV and the orthopteran genus B poxvirus of Melanoplus sanguinipes 
5 (MsEPV). Novel AmEPV genes include those encoding an ABC transporter and a Kunitz 

motif protease inhibitor. The most unusual feature of the AmEPV genome relates to the viral 
encoded poly(A) polymerase. In all other poxviruses this heterodimeric enzyme consists of 
a single large and small subunit. However, AmEPV appears to encode one large and two 
distinct small poly (A ) polymerase subunits. AmEPV is one of the few entomopoxviruses 

10 which can be grown and manipulated in cell culture. 

It is commonly observed in poxvirus genomes that the ORFs situated near the ends 
of the genome are preferentially transcribed towards the closest termini (Upton, C, Macen, 
J. L., Maranchuk, R. A., Delange, A. M., and McFadden, G. [1988] 'Tumorigenic 
poxviruses: fine analysis of the recombination junctions in malignant rabbit fibroma virus, 

15 a recombinant between Shope fibroma virus and myxoma virus" Virology 166, 229-239; 

Upton, C. and McFadden, G. [1986] "DNA sequence homology between the terminal 
inverted repeats of Shope fibroma virus and an endogenous cellular plasmid species" Mol 
Cell Biol 6 , 265-276). However, as can been seen in Figure 8, this is not the case with the 
ORFs of AmEPV, where no discernable pattern of transcription of genes near the termini can 

20 be observed. 

Previous estimates placed the A+T content of the AmEPV genome at 81.5% 
(Langridge, W. H. R. [1983] "Partial Characterization of DNA from Five 
Entomopoxviruses' V. lnvertehr.Palh 42, 369-375). A recalculation based upon the known 
sequence has raised this to 82.2%, making AmEPV the most A+T rich poxvirus sequenced 

25 to date. In this regard, it is interesting to note that 62% of all encoded amino acids are either 

He, Leu, Phe, Asn or Lys, in descending order of frequency, which are comprised mainly of 
A+T rich codons. The unusually high A+T content may also be reflected in the large number 
of translational stop codons (8.9% of coding capacity) and relatively few methionine 
encoding triplets (1 .6%). 
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Table 1 lists all the ORFs encoded by the AmEPV genome, and functions assigned 
to the encoded proteins. Default E (EXPECT) values of <0.0 1 were used to define homology 
to sequences in current databases. 52 AmEPV ORFs (28.6% of the genome) show homology 
to ChPV genes, and 91 ORFs (3 1 .3% of the genome) have homologs in EPVs or other insect 
5 viruses. The terminal regions of AmEPV contain few genes homologous to any other gene. 

Figure 8 illustrates this phenomenon, as well as the observation that AmEPV homologs of 
both vaccinia and MsEPV genes (which we have used as available examples for the ChPV 
and EPV) are positioned more towards the centre of the AmEPV genome. In contrast, novel 
AmEPVgenes are easily identified as occurring more often towards the genomic termini. 

10 



Table 1 . Predicted ORFs of the AmEPV genome. 
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position 
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Highest blast hit b 


Expect(E) 


aa 


Domains' 1 


U 


E C* 


Promoter' 


AMV115 


97853-98731 


293 


AF063866 


MSV041 PAP-S (J3R) 


1.00E-27 


295 


PARP reg 




X 


X 


L? 


AMV1 16 


991 26-98734 


131 












x 






L? 


AMV117 


99484-99131 


1 18 












x 






L? 


AMV11R 

r^ivi v i lu 




386 


AF0638fifi 


MSV090 put membrane protein 
(A16L) 


1 00E-121 


380 


TM 




X 


X 


L 


AMV119 


102089-101016 


358 


AF063866 


MSV081 PP2C 


4.00E-69 


357 


PP2C 




X 




L 


AMV120 


102151-102570 


140 


AF063866 


MSV082 hp 


2.00E-08 


139 






X 




L 


AMV121 


103396-102581 


272 


AF063866 


MSV064 hp 


2.00E-31 


280 






X 




E? 


AMV122 


105388-103688 


567 


U44841 


HaEPV rifampicin resistance gene 
(D13L) [69] 


* 


584 






X 


X 


E? 


AMV1 23 


I UJ3U I - I UJt f U 


144 










TM 










MIVI V 1 


1 fl7R9R 1 n^Q4ft 


627 










TM 








L 


AMV125 


107560-107739 


60 










TM 


X 






L? 


AMV126 


108199-107915 


95 












X 






E 


AMV127 


109346-108762 


195 


AF063866 


MSV060 (H2R) 


1.00E-57 


194 


TM 




X 


X 


L 


AMV128 


110119-109364 


252 










TM 


X 






E 


AMV129 


110338-110156 


61 












X 






E?,L 9 


AMV130 


110459-114610 


1384 


Z82272 


C elegans- similar to ABC 
transporters 


5e-54** 


1431 


TM 


X 






E 


MMV I o I 


i to/ i i - 1 i^y*t i 


9^7 
ZOI 




















AMV132 


116352-115732 


207 


AF017791 


HaEPV 17K ORF 


0.001 


148 






X 




E 


AMV133 


117243-116383 


287 


AF063866 


MSV048 lipase 


1.00E-56 


288 


lipase 3, 
TM 




X 


X 


L 


MMV I 04 


i i oooy- l i / ^oo 


ooo 


rtruooooo 


Movz'tu Leu ncn repeat tMmtir v vjo / 


O.UUE- I 0 


^97 
OZ i 






X 






MMV 1 Jj 


i ._ i od j- i I oy^to 


OfZ. 


rtrUOOOOO 


ftAQ\/nfi7 ni it mf?WA ranninn li>rn£i 

rviovuD/ put riirMNM Ldpptny idryt? 

ciiKiinit /Hi 

suuunu 1 1\) 




860 






X 


X 




MM V 1 Ju 


1 ^UtiJO-1 ZUvZo 


97 










1 IVI 








t 


MMV 1 of 


1 1 O / O-l i-ZZZZ 


91 K 
ZiO 


MrUDOooD 


Movuoo np 


2 00E-1 5 


160 


TM 




X 




1 

L 


MMV I 00 




^9fl 
oZ\) 


MrUOOOOD 


MQ\/1 m /A 1 1 Q\ 

Mo v i o i ^m i i r\j 




31 3 






X 


X 


1 

L 


MMV I oy 




1 1 

i i ^y 


nrUDOODD 


IVlOV I OZ r *fd ^M I ULJ 


1 C-OO'DC" 

29 


1 306 






X 


X 


L 


AMV140 


127596-126667 


310 


AF063866 


MSV170 hp 


3 00E-07 


324 






X 




E 


AMV141 


127730-129085 


452 


AF063866 


MSV050 hp 


5 00E-57 


379 


TM 




X 




L? 


AMV142 


128757-128554 


68 










TM 


X 








AMV143 


129503-129061 


141 










TM 


X 






L? 


AMV144 


129837-129493 


115 










TM 


X 






E 


AMV145 


130422-129880 


181 


AF063866 


MSV167 hp 


2.00E-15 


178 






X 




E,L 


AMV146 


129909-130115 


69 










SP 


x 






L? 


AMV147 


1 ^94ftfi 

I JUHO J" I O^tOD 


668 






1 00F-146 

1 UUI — 1 *T\J 


648 


TM 




X 


X 


L 


Mrvl v l *+o 


i 1 oz*toy 


1 56 










TM 










AMA/1 4Q 


I JJ^Os I oouuo 


144 










TM 








E 


AMV1 


1 ^4°^Q-1 T^S9f) 


240 


/A 1 UUOOUU 


mqv/171 ATP/flTP hinriinn nrntpin 
iviov if i n i r / vj} i r uii fun ly ui uic^in 

(A23L) 


1 00E-43 


244 






X 


X 


L? 


AMV1 SI 




21 7 


AF0fi38fifi 


IVIO v 1 f t ■If-' 


0.77 


184 






X 




L 


AMV1 52 


1 34^S4-1 34778 


75 










TM 








E 


AMV1 


1 ^4Qft7-1?R^Qn 


468 


AFnfi3ftfifi 


M^\/1 7^ ^pr/Thr nrntpin kina^p 

(F10L) 


4 0OE-73 


457 


TM 




X 


X 


E,L 


AMV154 


135283-135086 


66 












X 








AMV155 


136164-135970 


65 










TM 


X 






L? 


AMV156 


140090-136377 


1238 


AF063866 


MSV156 hp 


3 00E-28 


1127 






X 




E? L? 


AMV157 


140145-140876 


244 


AF063866 


MSV169 hp 


9.00E-12 


230 


TM 




X 




L? 


AMV158 


140599-140254 


116 










TM 


X 






L? 


AMV159 


141543-140890 


218 


AF063866 


MSV111 hp 


0 001 


201 


TM 




X 




L? 


AMV160 


142175-141549 


209 


AF063866 


MSV110 hp 


0.16 


181 


TM 




X 




E 


AMV161 


142449-142207 


81 


AF063866 


MSV108 hp 


1 00E-11 


76 


TM 




X 




L 


AMV162 


142949-142461 


163 


AF063866 


MSV106 (A22R) 


3 00E-26 


163 






X 


X 


L? 



(i: SH-APPS' UF-22 1 1 1 XC I . wpd DNB la 



21 UF-221C1XC1 





ORF 


position 


aa a 




Highest blast hit b 


Expect(E) 


aa 


Domains" 


U 


E 




Promc 




AMV163 


143230-142955 


92 


AF063866 


MSV112hp 


0042 


130 


TM 




X 




E 




AMV164 


143963-143256 


236 


AF063866 


MSV107 hp 


1 O0E-26 


226 


TM 




X 




L 




AMV165 


145086-144112 


325 










TM 


X 






E 




AMV166 


145849-145139 


237 


AF063866 


MSV100 RP019 (A5R) 


1 O0E-35 


230 






X 


X 


E,L 


5 


AMV167 


146277-146035 


81 


U 16956 


F. neoformas polyubiquitin [144] 


1 O0E-34 


381 


ubiquitin 




X 




E 




AMV168 


146669-146316 


118 


AF063866 


MSV165hp 


1 00E-04 


126 


TM 




X 




E?L 




AMV169 


146862-147086 


75 










TM 


X 






L 




AMV170 


147105-148697 


531 


AF063866 


MSV145 hp 


300E-83 


525 






X 




L 




AMV171 


148735-149010 


92 


AF063866 


MSV166 hp 


4 00E-24 


96 


TM 




X 




L 


10 


AMV172 


149358-149017 


114 


AF063866 


MSV098 hp 


6 00E-04 


108 






X 




L 




AMV173 


149405-150724 


440 


AF063866 


MSV157 hp 


9 00E-20 


430 






X 




E 




AMV174 


152725-150716 


670 


AF063866 


MSV113 VETF-S(D6R) 




674 


SNF2N/ 




X 


X 


E 


















helicase C, TM 








AMV175 


153799-152762 


346 


AF022176 


HaEPVorf6 


2.00E-75 


286 






X 




E 


15 


AMV176 


152802-153059 


86 


AF063866 


MSV194 ALI motif 


3 00E-14 


409 


TM SP 












AMV177 


154912-153833 


360 


AF022176 
AF063866 


HaEPV orf6 
MSV194 ALI motif 


6 OOE-73 
6.00E-11 


286 
409 










E 




AMV178 


153873-154130 


86 










TM SP 










20 


AMV1 7Q 
MiVl V \ OU 


1 *idQQfi-1 'ifi'M'} 

I vj'-f O JVJ 1 JU*,H J 

I DOi.y J-1 DO / 0** 


416 

1 RA 
I Oh 


AFOfi^flfifi 


MSV1 1 S ^HSR^ 

ivio v i i \j \ vl? *j r\ i 


J. UUL t. J 


505 




X 






F_ 




AMV181 


158275-156884 


464 


Af063866 


MSV1 89 core protein [G1 L] (I7L) 


1.00E-112 


443 








X 


L? 




AMV182 


157358-157552 


65 












X 






L? 




MIVI V I Oo 


l^flinn 1 f^QQRA 

i oOi_yu- 1 ooyo^ 




Mr UDoODD 


MQ\/1Qn /AmPP\/ npi 


^ nnp na 
o.uut-uy 




SP 








i 

L 


25 


AMV1 84 


158990-158745 


82 










TM 


X 






L? 




AM VI oo 


i oy*:y i -i oyuoo 


78 




/A mCD\/ Oil \ 

(AITItrV bJL] 




7R 




X 






t 




AMV1 86 


159318-159800 


161 


AF063866 


MSV132 (A28L) 


2.00E-45 


142 


TM 






X 


L? 




AMV1 R7 




1 003 


I J 1 QO^Q 

1 C7 ^_ O 






999 


TM, Leu 
zipper 




X 




I_ 




AMV/1 RR 


1fi1 ^69-1 fi1 ^R^ 

[U I sJO^- I VJ I JO J 


60 










TM 


X 








■J \J 


AMV1 RQ 

mivi v i oy 

am\/i on 
mivi v i yu 

AMV/1 Q1 

MIVI V ID I 


i d^o f o- 1 0^.0 yo 

1 / Df - 1 Di-JOJ 
1 fi9fi91 1 fi^SdR 


fin 

D I 

76 










TM 


X 
X 
X 






L r 
E 7 




mivi v i y^: 


iDDUjy- lojuyo 


D10 


MrUZ f DO / 


pfpp\/ mpm i /mil \ r^Ti 

Liltrv IXrn-l I IL^ lOJj 




647 


SNF2N/ 
helicase C 




X 




L 


35 


AM\/1 

mivi v i yo 


1 Ifi^Hfi^ 
IDOO If - IOuUDO 


1 50 


uojyo i 


n. SdfJIcMb dpopiUbia-dbSUOIdlcu 

T~yr r~\\ Ci\ r~i 
JJI UlCll 1 


a nnF da 

H . UUC-Ut 


674 




X 










AMV/1 QA 


IfiRfififi 1fi7nft1 

1 OJOOD- 1 \3 I UO 1 


472 




M9V/1QR MTY^ motif nana familv 
iviov i jo ivi i o imuui y ci ic laiiiny 

protein 


7 00E-46 


399 






X 




£ 




AMV195 


169144-167255 


630 


AE001415 


Pf hp 


1 00E-04 


1351 


TM 


X 






L 


40 


AMV196 


168955-169134 


60 










TM 


X 






L? 


AMV197 


169246-170142 


299 


AF1 70726 


Myxoma virus m142R (B1R) [154] 


1 00E-27 


306 


2 pkinase 




X 


X 


E 




AMV198 


171035-170724 


104 


AF019224 


HaEPV ORF F2L [161] 


1 00E-21 


101 


TM 




X 




L 




AMV199 


171052-172647 


532 


AF063866 


MSV162 NAD+dep DNA ligase 


1 00E-100 


522 


DNA ligase N 


X 




L? 




AMV^nn 

/AIVI V lUU 


1797QR-17^dR1 

\ l £.1 aO" 1 l OHO 1 


228 




IVIO V 1 J3 1 IfJ 


4 OOE-18 


225 






X 




I_ 




AMV201 


173525-173881 


119 










TM 


X 






E 


AMV202 


173835-173617 


73 










TM 


X 






L? 




AMV203 


174115-173888 


76 


AF063866 


MSV168 hp 


5 00E-06 


72 






X 




L 




AMV204 


174147-174395 


83 










TM 


X 






L 




AMV205 


175077-174394 


228 


AF063866 


MSV065 VLTF-3 (A2L) 


1 00E-47 


218 






X 


X 


L 


50 


AMV206 


175140-175601 


154 












X 






E 


AMV207 
AMV208 


177028-175601 
176467-176670 


476 
68 


AF063866 


MSV198 MTG motif gene family 
protein 


6 00E-87 


399 


TM 
TM 


X 


X 




L? 




AMV209 


178433-177045 


463 


AF063866 


MSV198 MTG motif gene family 
protein 


5 00E-97 


399 


TM 




X 




E? 
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position 


aa J 




Highest blast hit b 


Expect(E) 


aa 


Domains' 1 


U 


E 




Promoter' 


AMV210 


180326-178491 


612 


AF063866 


MSV117 DNA pol beta/AP 


1 00E-122 


603 


AP endo 21 




X 




L? 










polymerase 






























DNA pol X 










AMV211 


180741-180322 


140 


AF063866 


MSV137hp 


3 00E-19 


149 






X 




L 


AMV212 


181674-180823 


284 


M24328 


Pf Asp-rich protein [261] 


1 00E-05 


537 






X 




E 


AMV213 


181926-181720 


69 












X 






E 


AMV214 


183172-181961 


404 


AF063866 


MSV184hp 


8 00E-54 


415 


TM 




X 




E 


AMV215 


182265-182546 


94 










TM 


X 






L? 


AMV216 


184838-183216 


541 


AF063866 


MSV099 hp 


1 00E-41 


519 


TM 




X 




E 


AMV217 


184913-185650 


246 


AF063866 


MSV183 myristylated membrane 


5 00E-72 


242 


TM 




X 


X 


L? 










protein (L1R) 
















AMV218 


185690-185968 


93 


L27838 


P yoelii rhoptry protein 


0.006 


2269 




X 






E 


AMV219 


186862-185966 


299 


AF063866 


MSV072 hp 


1 00E-45 


298 






X 




L 


AMV220 


187176-186958 


73 










TM 


X 








AMV221 


187007-190909 


1301 


AF063866 


MSV043 RP0147 (J6R) 


- 


1319 


RNA pol A 




X 


X 


E 


AMV222 


190095-189904 


64 










TM 


X 






L? 


AMV223 


190945-191358 


138 










TM 


X 






E 


AMV224 


1Q?1 SR-IQISRQ 


1 90 


AF001 145 


B burgdorferi pred coding region 


0 001 


343 










L? 










RRfT^QR 
















MM Vi.iiO 


i yzoyy- 1 yozoo 


1 85 


















I_? 


MIV1 VttD 




1 64 


rtrUOOOOO 


IVIO v uo i mu 


7 00e-1 5** 


141 










L 


AMV227 


i ajta / - i yo / i ** 


86 










TM 








L? 


rt(V( V j_^0 


i 1 o- 1 yo / oy 


1 60 


AFnfi^Rfifi 
rtrUDOODD 


M<^\/nQ7 nut Ca9+ RP 


O UvL I *J 


1 40 


o FFhanri 








E ? 




i y*t*oj- 1 y*Jz«;y 


75 


















E 7 


M IV 1 V J U 


i y*to*t**- 1 youou 


17Q 

i f y 


AFnfi^flfifi 
MrUOjuDD 


rVIOVZ'fJ nru IO \Ufr\) 


'i nnF.?n 


1 86 










E L 


AMV231 


i 33i.oj- l yoyo^t 


244 


rtrUDOODD 


M^V^nfl uracil nWA nlvcnQvlaQP 


O . UVJ L JO 


232 


2 UNG 








E 










(D4R) 
















AMV232 


196415-195996 


140 


AF063866 


MSV142 put. membrane protein (J5L) 


3.00E-36 


139 


TM 




X 


X 


L 


AMV233 


196480-197103 


208 


AF063866 


MSV032 hp 


4 00E-20 


252 






X 




L 


AMV234 


197134-197844 


237 


AF063866 


MSV135 put. protein phosp hatase 2C 


3 00E-27 


239 


PP2C 




X 




E?,L 


AMV235 


197847-198521 


225 


AF063866 


MSV123 hp 


1 0OE-24 


230 






X 




L? 


AMV236 


199146-198517 


210 












X 






E 


AMV237 


199209-199445 


79 












X 






L 


AMV238 


199452-200795 


448 


AF063866 


MSV055 hp 


7.00E-09 


466 


TM 




X 




E,L 


AMV239 


200248-199973 


92 










TM 


< 






L? 


AMV240 


201591-200794 


266 


U42580 


Paramecium bursaria chlorella virus 


8 00E-07 


312 




X 






E? L 










A467L 
















AMV241 


201853-201638 


72 












X 






L 


AMV242 


201954-202283 


110 












X 






E 


AMV243 


203059-202316 


248 


AF063866 


MSV094 put. membrane protein (F9L ) 


7.00E-35 


241 


TM 




X 


X 


L? 


AMV244 


202716-203075 


120 










TM 


X 








AMV245 


203101-203577 


159 












X 






E 


AMV246 


204042-203572 


157 


L33180 


AcNPV phosp hotyrosine phosphatas e 


5.00E-28 


168 


DSPc 


X 






E 


AMV247 


204194-204610 


139 


AF063866 


MSV139 hp 


1 00E-07 


139 






X 




L 


AMV248 


204830-205696 


289 


AF063866 


MSV206 put. glycosyltransferase 


1.00E-42 


287 


TM 




X 




L? 


AMV249 


205711-206046 


112 


AF063866 


MSV209 (A21L) 


2 00E-24 


113 


TM 




X 


X 


L 


AMV250 


206114-206419 


102 












X 






E 


AMV251 


206367-206155 


71 










SP 


X 






E 


AMV252 


206716-206474 


81 










TM 


X 






L? 


AMV253 


206768-208222 


485 


X95275 


Pf frameshift 


6 OOE-04 


960 




X 






E 


AMV254 


208261-208905 


215 


AF063866 


MSV027 Trp gene family protein 


1.00E-19 


297 






X 




E 


AMV255 


208973-209428 


152 


P24705 


AcNPV superoxide dismutase 


1.00E-47 


151 


SODCu 


X 




X 


L? 


AMV256 


211257-209431 


609 


AF063866 


MSV056 metalloprotease (G1L) 


4.00E-05 


629 


TM 




X 


X 


L 


AMV257 


211349-21 1723 


125 


AF003534 


Chilo iridescent virus 011 L [1 96] 


2.00E-14 


230 






X 




L? 



Ci: SIl-APPS UF-22l(/lXCI.wpd[>NB,la 



23 UF-221C1XC1 



10 



ORF 


position 


aa* 




Highest blast hit" 


Expect(E) 


aa 


Domains" U E 


C* Promoter' 


AMV258 


211785-214262 


826 


X95275 


Pf frameshift 


9.00E-04 


960 


TM x 


E? 


AMV259 


214913-214488 


142 










X 


E 


AMV260 


216480-214969 


504 


X95275 


Pf frameshift 


3 00E-12 


960 


X 


E 


AMV261 


216586-217788 


401 










TM x 


E 


AMV262 


218411-217797 


205 










X 




AMV263 


219301-218438 


288 


AF067136 


PP1 reg subunit 7 hSDS22 homolog 
[261] 


8.00E-19 


360 


multiple LRR x 


E? 


AMV264 


220213-219377 


279 


AF063866 


MSV099 hp 


7.00E-04 


519 


TM x 


E 


AMV265 


221229-220318 


304 










TM x 


E 


AMV266 


221858-221307 


184 










X 


E 



' amino acids. h GenBank accession numbers. " likelihood of identity (liXPHCT) score. J predicted domains revealed by Ptam and Psort 
programs (see materials and methods). e U = genes not found in other poxviruses; li = genes found in other entomopowi ruses: C = genes 
found in chordopow iruses. ' promoter type E = early; I, ■= late, promoters with ambiguous motifs designated by ?. hp = hypothetical 
protein. * designates an I- score too low to be quantified. Vaccinia honiologs are shown in parentheses. MsEPV homologs not already 
15 listed are in brackets, braces contain formerly named AmEPV genes. 

Vertebrate poxviruses have been shown to generally share a co-linear arrangement 
of core genes (Goebel, S. J., Johnson, G. P., Perkus, M. E., Davis, S. W., Winslow, J. P., and 
Paoletti, E. [1990] 'The complete DNA sequence of vaccinia virus" Virology 179, 247-66, 

20 517-63; Massung, R. F„ Liu, L. I., Qi, J., Knight, J. C, Yuran, T. E„ Kerlavage, A. R., 

Parsons, J. M., Venter, J. C, and Esposito, J. J. [1994] "Analysis of the complete genome 
of smallpox variola major virus strain Bangladesh- 1 975" Virology 201:21 5-240; Senkevich, 
T. G., Koonin, E. V., Bugert, J. J., Darai, G., and Moss, B. [1997] "The genome of 
molluscum contagiosum virus: Analysis and comparison with other poxviruses" Virology 

25 233:19-42; Afonso, C. L., Tulman, E. R., Lu, Z., Zsak, L., Kutish, G. F.. and Rock, D. L. 

[2000] w The Genome of Fowlpox Virus" J. Virol 74:3815-3831). Sequence information 
from a number of EPVs suggested that this co-linear arrangement of core genes is not 
conserved in members of the EPV subfamily (Hall, R. L. and Moyer, R. W. [1993] 
"Identification of an Amsacta spheroidin-like protein within the occlusion bodies of 

30 Choristoneura entomopoxviruses" Virology 1 92: 1 79- 1 87; Sriskantha, A., Osborne, R. J., and 

Dall, D. J. [1997] "Mapping of the Heliothis armigera entomopoxvirus (HaEPV) genome, 
and analysis of genes encoding the HaEPV spheroidin and nucleoside triphosphate 
phosphohydrolasel proteins"./ Gen Virol 78:31 15-3123; Afonso, C. L., Tulman, E. R., Lu, 
Z., Oma, E., Kutish, G. F., and Rock, D. L. [1999] "The genome of Melanoplussanguinipes 

35 Entomopoxvirus" ./. Virol. 73, 533-552). The complete genomic sequence of AmEPV 

enables us to unequivocally confirm this, but also shows there is no conserved co-linear core 
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between viruses of genus B. Figure 9 graphically illustrates the absence of any type of 
shared spatial gene arrangement between a typical ChPV (VV), MsEPV and the genome of 
AmEPV. Note that flipping the AmEPV genome direction from 3" to 5* does not lessen the 
degree of gene shuffling which has occurred within these different viruses. 

5 

Promoter consensus sequences 

AmEPV contains promoter elements which govern gene expression. 133 AmEPV 
genes are considered to be early, or potentially early. 1 58 genes possess motifs which result 
in late, or potentially late promoters. Only 15 genes from the entire 279 gene genome have 

10 no recognizable promoter or regulatory elements. Genes that contain the sequences 

TGAAAXXXXA or TGAATXXXXA within 100 bases of their translational start codons 
were considered early (E) or potentially early (E?), respectively (Table 1). This motif 
resembles the ChPV early promoter core consensus sequence (Moss, B. [1996] Poxviridae: 
The viruses and their replication. In "Fields Virology" (B. N. Fields, D. M. Knipe, and P. M. 

15 Howley, Eds.), Vol. 2, pp. 2637-2672, Lippincott-Reven, Philadelphia; Senkevich, T. G., 

Koonin, E. V., Bugert, J. J., Darai, G., and Moss, B. [1997] 'The genome of molluscum 
contagiosum virus: Analysis and comparison with other poxviruses" Virology 233:19-42) 
and was also used to predict early genes of MsEPV (Afonso, C.L., E.R. Tulman, Z. Lu, E. 
Oma, G.F. Kutish, and D.L. Rock [1999] 'The genome of Melanoplus sanguinipes 

20 Entomopoxvirus" J. Virol. 73:533-552). These motifs have been found upsteam of known 

EPV early genes such as the TK gene (Gruidl, M. E., Hall, R. L., and Moyer, R. W. [1992] 
"Mapping and molecular characterization of a functional thymidine kinase from Amsacta 
moorei entomopoxvirus" Virology 186:507-516; Lytvyn, V., Fortin, Y., Banville, M., Arif, 
B., and Richardson, C. [1992] "Comparison of the thymidine kinase genes from three 

25 entomopoxviruses" J. Gen. Virol 73:3235-3240), CbEPV DNA polymerase (Mustafa, A. 

and Yuen, L. [1991] "Identification and sequencing of the Choristoneura biennis 
entomopoxvirus DNA poly merase gene" DNA Seq. 2:39-45), and the MmEPV fusolin gene 
(Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in 
vertebrate cells by recombinant entomopoxviruses"./ Virol. 71:9557-9562; Gauthier, L., 
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Cousserans, F., Veyrunes, J. C, and Bergoin, M. [1995] "The Melolontha melolontha 
entomopoxvirus (MmEPV) fusolin is related to the fusolins of lepidopteran EPVs and to the 
37K baculovirus glycoprotein 7 Virology 208:427-436). Out of the 36 early MsEPV gene 
homologs in AmEPV, 27 contain predicted early promoters. For comparison with vertebrate 
5 poxvirus homologs, we adopted the most recently published mammalian poxvirus promoter 

sequence predictions (Cameron, C, Hota-Mitchell, S., Chen, L., Barrett, J., Cao, J. X., 
Macaulay, C Wilier, D., Evans, D., and McFadden, G. [1999] "The complete DNA 
sequence of myxoma virus" Virology 264:298-3 1 8; Wilier, D. O., McFadden, G., and Evans, 
D. H. [1999] 'The complete genome sequence of shope (Rabbit) fibroma virus" Virology 

10 264:3 19-343), which emanated from the earlier work of Davison and Moss (Davison, A. J. 

and Moss, B. [1989] "Structure of vaccinia virus late promoters" J. Mol. Biol 210:771-784). 
Out of the 1 6 early MYX homologs in AmEPV, 1 1 contained early promoter elements. As 
a first approximation, candidate genes were considered early only if they contained the 
vaccinia virus early transcription termination sequence (TTTTTNT) near the 3' end of the 

15 gene; (Gruidl, M. E., Hall, R. L., and Moyer, R. W. [1992] ''Mapping and molecular 

characterization of a functional thymidine kinase from Amsacta moorei entomopoxvirus" 
Virology 186:507-516; Li, X., Barrett, J. W., Yuen, L., and Arif, B. M. [1997] "Cloning, 
sequencing and transcriptional analysis of the Choristoneura fumiferana entomopoxvirus 
spheroidin gene" Virus Res. 47:143-154; Sriskantha, A., Osborne, R. J., and Dall, D. J. 

20 [ 1 997] "Mapping of the Heliothis armigera entomopoxvirus (HaEPV) genome, and analysis 

of genes encoding the HaEPV spheroidin and nucleoside triphosphate phosphohydrolase I 
proteins" J Gen Virol 78:31 15-3123; Yuen, L. and Moss, B. [1987] "Oligonucleotide 
sequence signaling transcriptional termination of vaccinia virus early genes" Proc. Natl. 
Acad. Sci. U.S.A. 84:6417-6421). There are examples of early genes which contain the 

25 TTTTTNT motif towards the 5' N-terminal portion of the ORF such as the DNA polymerase 

of myxoma (Cameron, C, Hota-Mitchell, S.. Chen, L., Barrett, J., Cao, J. X., Macaulay, C, 
Wilier, D., Evans, D., and McFadden, G. [1999] "The complete DNA sequence of myxoma 
virus" Virology 264:298-3 1 8) and Shope fibroma (Wilier, D. O., McFadden, G., and Evans, 
D. H. [1999] "The complete genome sequence of shope (Rabbit) fibroma virus" Virology 
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264:3 19-343) viruses. Therefore, our estimates of early genes may be low. This sequence 
is found within 100 bases 3' of the ORF of 42 of the 116 predicted early AmEPV genes. 

AmEPV ORFs that contained the sequence TAAATG at the translational start site 
were considered late genes (L) (Bertholet, C, Stocco, P., Van Meir, E., and Wittek, R. 
5 [ 1 986] ''Functional analysis of the 5' flanking sequence of a vaccinia virus late gene" EMBO 

,/ 5:1951-1957; Rosef J. and Moss, B. [1985] "Transcriptional and translational mapping 
and nucleotide sequence analysis of a vaccinia virus gene encoding the precursor of the 
major core polypeptide 4b" J. Virol 56:830-838; Weir, J. P. and Moss, B. [1984] 
"Regulation of expression and nucleotide sequence of a late vaccinia virus gene" J. Virol. 

10 51:662-669; Davison, A. J. and Moss, B. [1989] "Structure of vaccinia virus late promoters" 

./. Mol. Biol. 210:771-784). This late promoter consensus sequence has been observed in 
other EPV late genes such as spheroidin, the nucleoside triphosphatase (NTPase), and 
hydrolase I (NPH-I),and topoisomerase(Hall, R. L., Li, Y., Feller, J. A., and Moyer, R. W. 
[1996] "The Amsacta moorei entomopoxvirus spheroidin gene is improperly transcribed in 

15 vertebrate poxviruses" Virology 224:427-436; Hall, R. L. and Moyer, R. W. [1991] 

"Identification,cloning, and sequencing of a fragment of Amsacta moorei entomopoxvirus 
DNA containing the spheroidin gene and three vaccinia virus-related open reading frames" 
J. Virol 65:6516-6527; Li, X., Barrett, J. W., Yuen. L., and Arif, B. M. [1997] "Cloning, 
sequencing and transcriptional analysis of the Choristoneura fumiferana entomopoxvirus 

20 spheroidin gene" Virus Res. 47:143-154; Sanz, P., Veyrunes, J. C, Cousserans, F.. and 

Bergoin, M. [1 994] "Cloning and sequencing of the spherulingene, the occlusion body major 
polypeptide of the Melolontha melolontha entomopoxvirus (MmEPV)" Virology 202:449- 
457; Sriskantha, A., Osborne, R. J., and Dall, D. J. [1997] "Mapping of the Heliothis 
armigera entomopoxvirus (HaEPV) genome, and analysis of genes encoding the HaEPV 

25 spheroidin and nucleoside triphosphate phosphohydrolase I proteins" J Gen Virol 78:31 15- 

3 1 23). Genes that contained the sequences TAAAT or TAAAAT within 1 00 bases upstream 
from their start codon were also potentially considered late genes (L?) (Table 1). These 
sequences have been found upstream of other late vertebrate poxvirus genes (Roseman, N. 
A. and Hruby, D. E. [1987] "Nucleotide sequence and transcript organization of a region of 
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the vaccinia virus genome which encodes a constitutively expressed gene required for DNA 
replication'./ Virol. 61:1398-1406). 

Terminal regions 

5 AmEPV is one of the few entompoxviruses which can be easily and reliably 

replicated in tissue culture (Winter, J., Hall, R. L., and Moyer, R. W. [1995] 'The effect of 
inhibitors on the growth of the entomopoxvirus from Amsacta moorei in Lymantria dispar 
(gypsy moth) cells" Virology 211 : 462-473; Hall, R. L., Li, Y., Feller, J. A., and Moyer, R. 
W. [1996] 'The Amsacta moorei entomopoxvirus spheroidin gene is improperly transcribed 

10 in vertebrate poxviruses" Virology 224:427-436). Because of this, we were able to obtain 

DNA for sequencing from a single clonal virus plaque, thus minimizing template 
heterogeneity. The results of sequencing from non-clonally isolated template DNA can be 
seen in the resultant sequence of MsEPV, where the two inverted terminal repeat (ITR) 
regions are not identical (Afonso, C. L., Tulman, E. R., Lu, Z., Oma, E., Kutish, G. F., and 

1 5 Rock, D. L. [ 1 999] "The genome of Melanoplus sanguinipes Entomopoxvirus" J. Virol. 73 , 

533-552). 

Poxvirus ITRs can vary considerably in size. The smallest ITRs are those of variola 
Bangladesh which are only 725 bp (Massung, R. F., Esposito, J. J.. Liu, L. L, Qi, J., 
Utterback, T. R., Knight, J. C, Aubin, L., Yuran, T. E., Parsons, J. M., Loparev, V. N., 

20 Selivanov, N. A., Cavallaro, K. F., Kerlavage, A. R., Mahy, B. W. J., and Venter, J. C. 

[1993] "Potential virulence determinants in terminal regions of variola smallpox virus 
genome" Nature 366:748-75 1; Massung, R. F., Liu, L. L, Qi, J., Knight, J. C, Yuran, T. E., 
Kerlavage, A. R., Parsons, J. M., Venter, J. C and Esposito, J. J. [1994] "Analysis of the 
complete genome of smallpox variola major virus strain Bangladesh- 1 975 v Virology 

25 201:21 5-240). The AmEPV genome contains identical ITR tandem repeats of 9.4 kbp at both 

termini which are organized in a fashion similar to that of other poxviruses; i.e. a series of 
tandemly repeated sequences interspersed with non-repetitive spacer region (Figure 1) 
(Massung, R. F., Knight, J. C, and Esposito, J. J. [1995] "Topography of variola smallpox 
virus inverted terminal repeats" Virology 21 1 :350-355; Wittek, R., Menna, A., Muller, H. 
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K., Schumperli, D., Boseley, P. G., and Wyler, R. [ 1 978] "Inverted terminal repeats in rabbit 
poxvirus and vaccinia virus DNA"./ Virol. 28:171-181 ;Upton,C. and McFadden,G. [1986J 
"DNA sequence homology between the terminal inverted repeats of Shope fibroma virus and 
an endogenous cellular plasmid species" Mol Cell Biol 6 , 265-276; AfonsoX. L., Tulman, 
5 E. R., Lu, Z„ Oma, E., Kutish, G. F., and Rock, D. L. [1999] "The genome of Melanoplus 

sanguinipes Entomopoxvirus" ./. Virol 73, 533-552). Myxoma virus and AmEPV share a 
similar ITR structure, in that the ORFs encoded in this region extend to the very ends of the 
genome termini, and contain very little non-coding DNA (Cameron, C, Hota-Mitchell, S., 
Chen, L., Barrett, J., Cao, J. X., Macaulay, C, Wilier, D., Evans. D., and McFadden, G. 

10 [1999] 'The complete DNA sequence of myxoma virus" Virology 264:298-318). Other 

sequenced poxvirus ITRs contain smaller numbers of genes interspersed with large regions 
of non-coding DNA within them. Other examples of poxvirus ITRs include MsEPV, which 
contains 3kb of non-coding DNA within the ITRs (Afonso, C. L., Tulman, E. R., Lu, Z., 
Oma, E., Kutish, G. F., and Rock, D. L. [1999] 'The genome of Melanoplus sanguinipes 

15 Entomopoxvirus" J. Virol 73, 533-552), 6kb in VV Ankara strain (Antoine,G., Scheiflinger, 

F., Dorner, F., and Falkner, F. G. [1 998] "The complete genomic sequence of the modified 
vaccinia Ankara strain: Comparison with other orthopoxviruses" Virology 244:365-396) and 
3kb in MCV (Senkevich, T. G., Koonin, E. V., Bugert, J. J., Darai, G., and Moss, B. [ 1 997] 
"The genome of molluscum contagiosum virus: Analysis and comparison with other 

20 poxviruses" Virology 233:19-42). As seen in Table 1 , each of the ORFs within the AmEPV 

ITR encodes a protein with no BLAST-derived function. The exception to this is 
AMVITR1 . This most terminal gene has homology to MsEPV MSV010. Although this 
gene is not within the ITR region of the MsEPV genome, it is located towards the left 
terminus. The gene encodes a member of the leucine rich gene family protein. 

25 Spontaneous DNA arrangements occur with an increased frequency at or near the 

terminal inverted repeat sequences of poxviral genomes (Moyer, R. W., Graves, R. L., and 
Rothe, C. T. [1980] 'The white pock (mu) mutants of rabbit poxvirus. III. Terminal DNA 
sequence duplication and transposition in rabbit poxvirus" Cell 22:545-553). Indeed, the 
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majority of novel and non-essential genes are generally found within poxviral ITRs or 
toward the genomic termini. 

AmEPV and vertebrate poxvirus gene homologs 
5 The complete genomic sequences of vaccinia and variola viruses from the 

orthopoxvirus genus, myxoma and Shope fibroma viruses from the leporipoxvirus genus, 
fowlpox from the avipoxvirus genus, the molluscipoxvirusmolluscum contagiosum and the 
genus B EP V, MsEPV have allowed definition of conserved poxvirus genes present in most, 
if not all, poxviruses. Inclusion of the AmEPV genomic sequence extends that concept. 

10 Inspection of the AmEPV sequence shows 52 ORFs which have homology to genes 

found in ChPV (Table 2). Of these, 44 have been assigned a function. Of the 44 ORFs with 
an assigned function, 18 are derived from proteins involved in mRNA synthesis which 
include 5 ORFs comprising an RNA polymerase, 4 ORFs likely to encode transcription 
factors, 3 ORFs related to helicases/NTPases and 5 ORFs devoted to post-transcriptional 

15 mRNA modifications. Of the ORFs devoted to mRNA modification, the poly (A) 

polymerase deserves special mention. Normally, this heterodimeric enzyme consists of a 
large and small subunit. However, the AmEPV sequence reveals the presence of three rather 
than the expected two potential subunits. The ORFs AMV038, AMV060 and AM VI 15 are 
predicted to represent one large and two small poly(A) polymerase subunits respectively. 

20 This unusual feature will be discussed in a subsequent section. There are 7 homologous 

ORFs involved in functions of DNA replication/repair, which include a DNA polymerase, 
photolyase, nucleotide phosphohydrolase, DNA topoisomerase and a uracil DNA 
glycosylase. Interestingly, neither AmEPV nor MsEPV encode a homolog of the vaccinia 
I3L protein. The I3L protein is a DNA binding protein and is presumably involved in DNA 

25 replication (Davis, R. E. and Mathews, C. K. [1993] "Acidic C terminus of vaccinia virus 

DNA-binding protein interacts with ribonucleotide reductase" Proc. Natl Acad. Sci. U.S.A. 
90:745-749). Ten ChPV/EPV ORFs are associated with conserved virus structural proteins. 
Finally, there are 8 ORFS associated with enzymatic activities not strictly related to nucleic 
acid metabolism. 
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Table 2. Chordopoxvirus homologs found within AmEPV. 



AmEPV 
ORF 


Length 

amino 
acids 


MsEPV 
ORF 


Length 

amino 
acids 


W 
ORF 


Length 

amino 
acids 


Gene name 
and/or function 


Transcription/RNA Modification 


RNA polymerase 


AMV051 


349 


149 


348 


A29L 


305 


RP035 


AMV054 


822 


119 


807 


H4L 


795 


RAP94 


AMV066 


1196 


155 


1190 


A24R 


1164 


RP0132 


AMV166 


237 


100 


230 


A5R 


164 


RP019 


AMV221 


1301 


43 


1319 


J6R 


1286 


RP0147 


AMV230 


179 


245 


186 


D7R 


161 


RP018 


Transcription Factors 


AMV047 


259 


187 


261 


A1L 


150 


VLTF-2 


AMV091 


356 


52 


345 


A23R 


382 


transcription factor 


AMV105 


767 


63 


760 


A7L 


710 


VETF-L 


AMV174 


670 


113 


674 


D6R 


637 


VETF-s 


AMV205 


228 


65 


218 


A2L 


224 


VLTF-3 


NTPase/helicase 


AMV059 


469 


148 


471 


A18R 


493 


DNA helicase 


AMV081 


720 


86 


717 


I8R 


676 


RNA helicase 


AMV192 


648 


53 


647 


D11L 


631 


NPH-I 


mRNA modification 


AMV038 


573 


143 


571 


E1L 


479 


PAP-L 


AMV060 


295 


41 


293 


J3R 


333 


PAP-S 


AMV093 


262 


124 


267 


D12L 


287 


mRNA capping small subunit 


AMV115 


293 


41 


295 


J3R 


333 


PAP-S 


AMV135 


627 


67 


860 


D1R 


844 


mRNA capping large subunit 


DNA replication/repair 


AMV016 


182 


N/A 


N/A 


J2R 


177 


Thymidine Kinase 


AMV025 


453 


235 


466 


S127L* 


445 


CPD photolyase 


AMV050 


1105 


36 


964 


E9L 


1006 


DNA polymerase 


AMV052 


333 


130 


328 


H6R 


314 


DNA topoisomerase 


AMV058 


276 


150 


289 


D10R 


248 


NTP pyrophosphohydorlase/ 
mutT 


AMV087 


726 


89 


834 


D5R 


785 


NTPase 


AMV231 


344 


208 


232 


D4R 


218 


uracil DNA glycosylase UNG 


Structural 


AMV035 


336 


121 


333 


G9R 


340 


membrane protein 


AMV061 


255 


158 


293 


L4R 


251 


30K virion protein 


AMV118 


386 


90 


380 


A16L 


378 


membrane protein 


AMV122 


567 


69 


584 


D13L 


551 


rifampicin resistance gene 


AMV139 


1149 


152 


1306 


A10L 


891 


P4a core protein 
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15 



20 



AmEPV 
ORF 


Length 

amino 
acids 


MsEPV 
ORF 


Length 

amino 
acids 


W 
ORF 


Length 

amino 
acids 


Gene name 
and/or function 


AMV147 


668 


164 


648 


A3L 


644 


P4b core protein 


AMV181 


464 


189 


464 


I7L 


423 


core protein 


AMV217 


246 


183 


242 


L1R 


250 


myristylated membrane protein 


AMV232 


140 


142 


139 


J5L 


133 


membrane protein 


AMV243 


248 


94 


241 


F9L 


212 


membrane protein 


Enzymes 


AMV078 


165 


N/A 


N/A 


S069L* 


173 


protein tyrosine phosphatase 


AMV114 


105 


93 


107 


E10R 


95 


put. redox 


AMV133 


287 


48 


288 


M5L** 


75 


lipase 


AMV150 


240 


171 


244 


A32L 


300 


ATP/GTP binding protein 


AMV153 


468 


173 


457 


F10L 


439 


Ser/Thr protein kinase 


AMV197 


299 


154 


396 


B1R 


300 


Ser/Thr protein kinase 


AMV256 


609 


56 


629 


G1L 


591 


metalloprotease 


AMV255 


152 


N/A 


N/A 


A45R 


163 


Cu-Zn superoxide dismutase 


Others 


AMV041 


213 


39 


193 


G6R 


165 




AMV069 


348 


180 


343 


L3L 


350 




AMV127 


195 


60 


194 


H2R 


189 




AMV138 


320 


151 


313 


A11R 


318 




AMV162 


163 


106 


163 


A22R 


176 




AMV179 


416 


115 


505 


GSR 


434 




AMV186 


161 


132 


161 


A28L 


146 




AMV249 


112 


209 


113 


A21L 


117 





25 All ChPV homolog ORFs shown are from VV. Where no homolog exists, *SFW and 

**CPV. 

Several of the genes included in Table 2 showing AmEPV vertebrate poxvirus 
homologs are not universally conserved, but are nevertheless present in many poxviruses. 
One example is the thymidine kinase (TK) gene. AmEPV encodes a TK gene, as do most 

30 ChPV and most other genus B EPVs investigated to date (Lytvyn, V., Fortin, Y., Banville, 

M., Arif, B., and Richardson, C. [1992] "Comparison of the thymidine kinase genes from 
three entomopoxviruses"./ Gen. Virol. 73:3235-3240). However, the gene is noticeably 
absent from both molluscum contagiosum and MsEPV (Senkevich, T. G., Koonin, E. V., 
Bugert, J. J., Darai,G., and Moss, B. [1997] "The genome of molluscum contagiosum virus: 

35 Analysis and comparison with other poxviruses" Virology 233:19-42; Afonso, C. L., 

Tulman, E. R., Lu, Z., Oma, E., Kutish. G. F., and Rock, D. L. [1999] "The genome of 
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Melanoplus sanguinipes Entomopoxvirus" J. Virol. 73, 533-552). Perhaps as previously 
suggested in the case of MsEPV, the absence of a TK and other enzymes related to 
nucleotide biosynthesis is reflective of a differential dependence on host biosynthetic 
pathways (Afonso et al [1999] supra). Similarly, the CPD photolyase is not universally 
5 conserved within all members of the poxvirus family, but is present in a number of different 

viruses. Also of note is MsEPV ORF237, which is homologous to vaccinia virus B2R 
(Afonso et al. [1999] supra; Goebel, S. J., Johnson, G. P., Perkus, M. E., Davis, S. W., 
Winslow, J. P., and Paoletti, E. [1990] 'The complete DNA sequence of vaccinia virus" 
Virology 179, 247-66, 517-63). This ORF, found at the right termini of both viruses, is 

10 absent from the genome of AmEPV. Likewise, a Cu/Zn superoxide dismutase (SOD) found 

within AmEPV (AMV255) is absent from the genome of MsEPV, and is fragmented or 
partially deleted in many orthopoxviruses (Smith, G. L., Chan, Y. S., and Howard, S. T. 
[1991] "Nucleotide sequence of 42 kbp of vaccinia virus strain WR from near the right 
inverted terminal repeat' V. Gen. Virol 72:1349-1376;Cameron,C., Hota-MitchelfS., Chen, 

15 L., Barrett, J., Cao, J. X., Macaulay, C, Wilier, D., Evans, D., and McFadden, G. [1999] 

"The complete DNA sequence of myxoma virus" Virology 264:298-3 1 8). Both EPVs also 
encode a homolog of the A2 1 L protein of V V. The A2 1 L protein has been shown to interact 
with the A6L protein using the two hybrid system (McCraith, S., Holtzman, T., Moss, B., 
and Fields, S. [2000] "Genome-wide analysis of vaccinia virus protein-protein interactions" 

20 Proc. Natl Acad. Sci. U.S.A 97:4879-4884). Interestingly, there is no homolog of the A6L 

protein in either EPV. Therefore, with the exception of the TK, SOD, protein tyrosine 
phosphatase and VV B2R, AmEPV and MsEPV share the same suite of ChPV virus 
homologs. 

25 A comparison of ORF content between AmEPV and other EPVs 

As well as the core poxviral genes shared between ChPV and EPV shown in Table 
2, there are a number of genes which are shared between sequenced entomopoxviruses ie. 
AmEPV and MsEPV. Limited sequence data is also available from various regions of other 
entomopoxviruses currently under investigation. Given the vastly differing host 
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requirements of the ChPVs and EPVs, it is not unexpected that many genes differ between 
the two subfamilies. Approximately one third of genes encoded by ChPVs are responsible 
for a response against host immune defense systems (Gooding, L. R. [1992] "Virus proteins 
that counteract host immune defenses" Cell 71:5-7; Smith. G. L. [1994] kw Virus strategies for 
5 evasion of the host response to infection" Trends in Microbiol. 2:81-88; Smith, G. L. [2000] 

"Secreted poxvirus proteins that interact with the immune system" Effects of Microbes on 
the Immune System 491-507). The 69 genes shared between MsEPV and AmEPV (but 
absent in ChPVs) are likely involved in insect specific interactions. The pattern of gene 
organization within the genome of Genus B EPVs has long been realized to be distinct from 

10 those of the ChPVs (Hall, R. L. and Moyer, R. W. [1991] "Identification, cloning, and 

sequencing of a fragment of Amsacta moorei entomopoxvirus DNA containing the 
spheroidin gene and three vaccinia virus-related open reading frames" J. Virol 65:6516- 
6527; Sriskantha et ai [1997] supra; Afonso et al. [1999] supra). However it is now also 
evident that within genus B, obvious reorganization has occurred. For example, the NPH-1 

15 and spheroidin homologs are immediately adjacent in all other known genus B viruses from 

Choristoneura and Heliothis* but are separated by 20kb in MsEPV. Similarly, the 
juxtaposed A23R protein and NPH-1 homolog in MsEPV are separated by 78kb in the 
AmEPV genome. Although there are no areas of organizational identity between the MsEPV 
and AmEPV genomes, there is one region of AmEPV genes (AMV159-AMV164) which 

20 contains homologs to MsEPV genes in the order of MSV1 11, 110, 108, 106, 112, 107. 

Given the lack of spatial conservation and degree of gene shuffling between MsEPV and 
AmEPV genes in all other areas of the AmEPV genome, small groups of genes may be 
present as the last remnants of divergence from a common ancestor. Alternatively, small 
clusters might have remained in close proximity to each other due to a more recent 

25 acquisition or for functional or regulatory reasons. Albeit not as striking an example, the 

homologs of MsEPV genes MSV085-MSV089 are also non-sequentially grouped within a 
9 gene assembly within AmEPV (AMV079-AMV087). For these reasons, it is likely a 
conserved colinear core of genes may only be shared among the lepidopteran viruses within 
EPV genus B (Afonso et ai [1999] supra). 
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In this regard, the comparative alignment of two lepidopteran group B viruses, 
AmEPV and published HaEPV genes, does reveal some organizational similarities. Certain 
co-linear regions do appear to be shared. Positions of the spheroidin, NPH-1, "Q3" and 
DNA polymerases are all similarly situated within the genomes of the two viruses. The 
5 juxtaposed HaEPV PAP2, 30K and ORF4 genes are also immediately adjacent and co-linear 

in AmEPV, with ORF direction preserved (AMV060, AMV061 and AMV062)(Crnov and 
Dall 1999). Comparative alignments have also highlighted differences between these two 
more closely related genus B lepidopteran EPVs. For example, the large RNA polymerase 
of HaEPV is located toward the leftmost end of the genome, whereas it is positioned at the 

10 right end of AmEPV. Likewise the "17K" ORF of HaEPV is duplicated and teminally 

located within its ITRs, but homologous regions within AmEPV are not repeated, and are 
positioned approximately one hundred genes from the genomic termini. Whether or not a 
generally co-linear arrangement of genes emerges for the lepidopteran EPVs, it is obvious 
that EPVs in general have not followed the evolutionary direction of ChPV which has 

15 enabled them to retain a common co-linear gene core. 

Clearly, genes shared between AmEPV and MsEPV are not arranged in a co-linear 
fashion and based on overall gene organization, MsEPV and AmEPV may be far more 
distantly related than the current common morphologically based classification as genus B 
EPVs would suggest. There are two possibilities to explain this divergence in gene order 

20 between AmEPV (lepidopteron) and MsEPV (orthopteran) viruses. One model employs a 

large evolutionary gap between the two viruses. A second model is based on intrinsic 
genomic plasticity and generalized movement of genes within the viral chromosome of 
EPVs. Comparative homologies among essential genes; e.g. RNA polymerase subunits, 
suggests MsEPV and AmEPV are more closely related to each other than either is to ChPVs 

25 homologs. Therefore, it may well be that plasticity or position independent location of genes 

within EPVs plays a significant role in the creation of divergent gene orders. 

AmEPV encodes an additional 27 gene homologs not found w ithin ChPV or MsEPV, 
but which are present in other insect viruses including baculoviruses (AcNPV, XcGV, 
SeNPV, CpGV, LdNPV and TnGV) and an iridovirus(Chilo iridescent virus). The majority 
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of these genes have previously been assigned functions, and a number are not specific to 
insect viruses alone (see Table 1 ). 

AmEPV gene families 

MsEPV was found to encode 43 novel ORFs which could be grouped into five gene 
families of varying stringency. Examination of the AmEPV genomic sequence revealed the 
presence of 23 genes which can be grouped into six gene families (Table 3). 



Table 3. AmEPV Gene Families. 
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The AMV 1 76 gene family has no homology to any proteins within current databases. 
Each of these 86 residue proteins is identical, except for a single nucleotide substitution in 
AMV056 which results in an isoleucinecodon at residue 37, instead of the leucine coded by 
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both AMV176 and AMV178. It is unusual to observe perfect copies of genes within a gene 
family. All members of the family are predicted to contain a transmembrane domain. 

The five member ALI-like (alanine-leucine-isoleucine) gene family largely 
comprises ORFs related to the AMV 1 76 gene family discussed above. The ORFs do not 
5 possess any motifs indicative of transmembrane domains or signal sequences. AMV055, 

appears to be a carboxy terminal truncated member of this family. This 133 residue ORF 
shares a large number of residue identities with the other family members. The final member 
of the family, AMV257, appears to be truncated at the N-terminus. and is less related to the 
other members of this family. Nevertheless, its homology to MS V 1 96 warrants its inclusion 

10 in this group. 

A third MTG-like gene family has three members; AMV 194, AMV207 and 
AMV209. There is a 69% identity between AM V207 and AM V209. AMV 1 94 is somewhat 
less related to the other family members. Each gene was identified independantly based on 
its homology to the MTG gene family ORF MSV198 found in MsEPV. However, the 

15 invariant signature MTG (methionine-threonine-glycine) motif is absent from all AmEPV 

proteins, and an expected internal motif found within the MsEPV proteins was found to be 
degenerate. 

A fourth family comprising only AMV029 and AMV254 shows homology to 
MsEPV ORF MSV027, which is a member of the tryptophan repeat gene family. Both 

20 AmEPV ORFs contain the expected motifs, although AMV029 does show degeneracy. 

The fifth, 17K ORF* gene family, contains five members which do not show any 
homology to MsEPV proteins, but are instead related to the 1 7K ORF of HaEPV. AMV024, 
AMV110 and AMV112 show excellent conservation at both their amino and carboxy 
termini, with a 60 residue internal portion of lesser similarity. Interestingly, these three 

25 genes also show homology with the N 1 R/p28 gene family of FPV (FPV 1 24). AMV 1 00 and 

AMV 1 32 are also homologous to the HaEPV 1 7K ORF, and to FPV248, but less so. There 
is no homology between these two predicted AmEPV proteins themselves. Fifteen residues 
are shared between all members of this family. 
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The sixth gene family is the LRR (leucine-rich repeat) gene family which contains 
five AmEPV genes based upon the position of a motif containing regularly spaced leucine 
residues. There is a large LRR gene family in MsEPV. Each of the five members of the 
AmEPV LRR-like family shows homology to an LRR gene family protein of MsEPV. 
5 AMVITR1 and AMV005 are 63% identical, and very well conserved at their amino 

terminus. AMV014 and AMV134 share regions of homology along their lengths. At 1 17 
residues, AMV076 is significantly smaller than other LRR-like gene family members 
(varying from 350 to 535 residues). However, when aligned with all other family members, 
an internal conserved motif emerges which includes seven leucine or isoleucine residues. 

10 

AmEPV ORFs encoding unique gene products 

The majority of the unique AmEPV genes are located at the terminal extremes of the 
virus genome, as can be easily observed in Figure 7. More than one third of AmEPV ORFs 
(128 out of 279) show no homology to any sequences currently in the databases. We have 
15 classified these novel ORFs on the basis of whether they contain a predicted transmembrane 

domain (TM) and/or signal peptide (SP). Based on this classification, 4 ORFs possess 
predicted TM and SP domains, 3 an SP only, 56 a TM alone, and 65 possess neither. We 
expect that like the ChPV, a number of AmEPV genes are devoted to overcoming host 
defense responses. 

20 Most genes encoded by AmEPV have homologs in ChPVs, EPVs or other insect 

viruses. In addition, there are a number of ORFs unique to AmEPV. There are also ORFs 
of interest potentially involved in host pathogenesis or virulence, such as AMV 133 (SEQ ID 
NO: 1) which encodes a lipase, AMV255 (SEQ ID NO: 2) which encodes a superoxide 
dismutase (SOD), AMV025 (SEQ ID NO: 3) encoding a CPD photolyase and AMV021 

25 (SEQ ID NO: 4) which encodes a baculovirus-like inhibitor of apoptosis (IAP). The 

following paragraphs briefly discuss each of these genes and their expected interactions with 
the host immune system. 

AMV133 (SEQ ID NO: 1) encodes a AmEPV triacylglyceride lipase gene which 
could conceivably function as a virulence gene through lipid hydrolysis. AmEPV has been 
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shown to launch a promiscuous infection within the insect, including the fat body (Arif, B. 
M. and Kurstak,E. [1991]. The Entomopoxviruses.//7 "Viruses of Invertebrates," E. Kurstak, 
Ed.„ pp. 175-195. Marcel Dekker, Inc., New York), which is the major site of lipid storage 
(Chapman, R. F. [1998] Circulatory system, blood and immune systems. In "The Insects" 
5 pp. 94-131, Cambridge University Press, Cambridge). Although AmEPV infected insects 

do not undergo the "melting" phenotype associated with baculovirus infection, lipid 
hydrolysis would be anticipated to increase viral virulence, as has also been suggested for 
the lipase gene of MsEPV (Afonso et al [1999] supra). Ectromelia and CPV are the only 
other poxviruses which encode similar proteins, and these are thought to play a role in the 

10 viral inflammatory response (Wall, E. M., Cao, J. X., Chen, N. H., Buller, R. L., and Upton, 

C. [1997] "A novel poxvirus gene and its human homolog are similar to an E-coli 
Lysophospholipase" Virus Res. 52: 1 57-167). 

AmEPV AMV255 (SEQ ID NO: 2) encodes a Cu + 7Zn~ superoxide dismutase 
homolog. These proteins are widespread in nature and are recognized as a primary defense 

15 against the damage of superoxide radicals (Fridovich, I. [1997] "Superoxide anion radical 

(02-.), superoxide dismutases, and related matters" J. Biol Chern. 272:18515-18517). 
Although the SOD homolog was initially discovered in a baculovirus (Tomalski, M. D., 
Eldridge, R., and Miller, L. K. [1991] "A baculovirus homolog of a Cu/Zn superoxide 
dismutase gene" Virology 184, 149-161), all sequenced ChPVs have also been found to 

20 encode a vestige of a SOD. However, many include deletions or substitutions within the 

coding region which render the protein inactive (Smith, G. L., Chan, Y. S., and Howard, S. 
T. [1991] "Nucleotide sequence of 42 kbp of vaccinia virus strain WR from near the right 
inverted terminal repeat"./ Gen. Virol 72, 1349-1376; Wilier al [ 1 999] s upra) but the 
AmEPV homolog appears to be intact. MsEPV does not encode a sod (Afonso et al [1 999] 

25 supra). 

During their life cycle, most insect viruses spend some period of time exposed to 
potentially detrimental environmental conditions. Therefore it is somewhat surprising that 
more insect viral genomes do not contain light-dependant DNA-repair mechanisms. 
AmEPV AMV025 (SEQ ID NO: 3) and MsEPV both encode a CPD photolyase homolog. 
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as do the ChPV SFV and MYX. These are the only reports of virally encoded CPD 
photolyases. 

Viruses have evolved various strategies to inhibit apoptosis, thereby allowing 
intracellular viral replication. In ChPVs, apoptosis is controlled in part by serpins (Petit, F., 
5 Bergagnoli, S., Gelfi, J., Fassy, F., Boucraut-Baralon, C, and Milon, A. [1996] 

"Characterization of a myxoma virus-encoded serpin-like protein with activity against 
interleukin-lb converting enzyme" J. Virol. 70:5860-5866; Ray, C. A., Black, R. A., 
Kronheim,S. R., Greenstreet,T. A., Sleath,P. R., Salvesen, G. S., and Pickup, D. J. [1992] 
"Viral inhibition of inflammation: cowpox virus encodes an inhibitor of the interleukin-1 

10 beta converting enzyme" Cell 69:597-604; Spriggs, M. K., Hruby, D. E., Maliszeswki, C. 

R., Pickup, D. J., Sims, J. E., Buller, R. M. L., and VanSlyke, J. [1992] "Vaccinia and 
Cowpox viruses encode a novel secreted interleukin-1 binding protein" Cell 71:145-152; 
Ray, C. A. and Pickup, D. J. [1996] "The mode of death of pig kidney cells infected with 
cowpox virus is governed by the expression of the crmA gene" Virology 217:384-391; 

15 Macen, J., Takahashi, A., Moon, K. B., Nathaniel, R., Turner, P. C, and Moyer, R. W. 

[1998] "Activation of caspasesin pig kidney cells infected with wild-type and CrmA/SPI-2 
mutants of cowpox and rabbitpox viruses' V. Virol 72:3524-3533; Turner, P. C. and Moyer, 
R. W. [1998] "Control of apoptosis by poxviruses. Seminars in Virology 8:453-469). Insect 
viruses control apoptosis through either p35 or through a series of inhibitor of apoptosis 

20 (IAP) proteins (Deveraux, Q. L. and Reed, T. C. [1999] "IAP family proteins - suppressors 

of apoptosis" Genes & Development 1 3:239-252; Miller, L. K. [1999] "An exegesis of I APs: 
salvation and surprises from BIR motifs" Trends Cell Biol. 9:323-328; Manji, G. A., Hozak, 
R. R., LaCount,D. J., and Friesen,P. D. [1997] "Baculovirusinhibitorof apoptosis functions 
at or upstream of the apoptotic suppressor P35 to prevent programmed cell death" J. Virol. 

25 7 1 :4509-45 1 6). AM V02 1 (SEQ ID NO: 4) encodes one such inhibitor of apoptosis protein 

(IAP), and contains two typical baculovirus IAP repeats and a C-terminal RING finger motif. 
The AMV021 ORF shows significant identity to the IAP of Cydia pomonella granulosis 
virus (47%), which has previously been shown to be functionally active (Crook, N. E., Clem, 
R. J., and Miller, L. K. [ 1 993] "An apoptosis-inhibitingbaculovirus gene with a zinc finger- 



G SH-APPS Ul -22 1 C IXC I .wpdDNB la 



40 UF-221C1XC1 

like motif V. Virol. 67:2168-2174). AmEPVand MsEPV are the only poxviruses found to 
encode IAPs. These proteins have only been noted in the genomes of viruses which infect 
insect or arthropod hosts. 

The following are several different ORFs encoded by AmEPV, which are notable 
5 either because they currently do not have homologs in any published viral sequence to date 

or possess novel aspects of previously described poxvirus genes. 

AMV060 (SEQ ID NO: 5) and AMV 1 1 5 (SEQ ID NO: 6) encode a first and second 
AmEPV poly(A) polymerase subunit. ORFs AMV060 and AMV1 15 present a completely 
unanticipated variation of a well detailed poxvirus encoded enzyme, the poly(A) polymerase. 

10 The cytoplasmic synthesis of poxvirus mRNAs involves not only transcription of a given 

gene by the viral RNA polymerase, but also post-transcriptional modification of the 
transcripts, including 3' poly(A) addition and 5'capping as well as 2 , 0-methylation. In the 
case of VV, addition of poly(A) to transcripts and 2'0-methylation of the mRNAs involves 
a heterodimeric poly(A) polymerase consisting of one large (VP55) and one single small 

15 (VP39) subunit encoded by two distinct ORFs (BrakeK C. and Kates, J. R. [1974] "Poly(A) 

polymerase from vaccinia virus-infected cells. I. Partial purification and characterization" 
J. Virol. 14:715-723; Gershon, P. D., Ahn, B. Y„ Garfield, M., and Moss, B. [1991] 
"Poly(A) polymerase and a dissociable polyadenylation stimulatory factor encoded by 
vaccinia virus" Cell 66:1269-1278; Schnierle, B. S., Gershon, P. D., and Moss, B. [1992] 

20 "Cap-specific mRNA (nucleoside-02'-)-methyltransferase and poly(A) polymerase 

stimulatory activities of vaccinia virus are mediated by a single protein" Proc. Natl. Acad. 
Sci. U.S.A. 89:2897-2901). 

The AmEPV genomic sequence has revealed an unusual feature of the poly(A) 
polymerase in this entomopoxvirus. Like other poxviruses, there is a single, large subunit 

25 (AMV038) (SEQ ID NO: 1 1 ) of approximately 570 amino acids. This is similar in size to 

the large VV poly(A) polymerase subunit (VP55). However, unlike any other poxvirus 
(Afonso et al. [2000] supra: Afonso e t al. [ 1 999] supra\ Cameron et al. [ 1 999] supra: Wilier 
et al. [1999] supra: Senkevich et al. [1997] supra: Antoine et al. [1998] supra: Goebele/t//. 
[1990] supra), sequencing suggests that AmEPV may encode two small subunits (AMV060 
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and AMV1 15). The two small subunitsare somewhat smaller than the 333 amino acid VV 
small sublimit (295 and 293 amino acids respectively) (Figure 10) and related throughout 
their length. 

Comparison of both AMV060 and AMV 1 1 5 to the small subunit of VV (VP39, ORF 
5 J3R), MsEPV (MSV041) and the sole poly(A) polymerase subunit revealed in the 

incompletely sequenced genome of HaEPV, is striking (Figure 10). Both AmEPV small 
subunits show the largest degrees of relatedness to other poxvirus poly( A) polymerase small 
subunits within the first 200 amino acids. Both AmEPV small subunits contain a highly 
conserved poly(A) polymerase regulatory structural motif encompassing amino acids 1 -28 1 

10 within AMV060 and amino acids 8-271 within AMV1 15. The AMV060 subunit is more 

related to VP39 than is AMV1 15. However, if both the two AmEPV small subunits are both 
compared to the single poly(A) polymerase small subunit of MsEPV (MSV041), the 
homologies for both AMV60 and AMV 1 1 5 to MSV041 comparable and greater than either 
of the small AmEPV subunits to VV. BLAST values for AMV060 showed it to be most 

15 related to the small poly(A) polymerase subunit sequenced from HaEPV (Sriskantha et ai 

[1997] supra)* while AMV1 15 was most homologous to that of MsEPV. 

One highly conserved, ungapped motif (Figure 10) [L/V]-Y-I-G-S-X-X-[G/A]-[Y/T]- 
H-X-X-X-L can be somewhat expanded if comparisons are limited to only EPV sequences. 
Note that there are other completely conserved residues in the centermost region of the 

20 proteins, and many other conservative substitutions. 

Another interesting feature is revealed when one examines the C-terminus of the 
small subunits (Figure 10). One immediately notes that the comparable VV small subunit 
contains a C-terminal extension. The VV 36-43 amino acid C-terminal tail is non-essential 
for activity (Shi, X., Yao, P., Jose, T.„ and Gershon, P. [1996] "Methyltransferase-specific 

25 domains within VP-39, a bifunctional protein that participates in the modification of both 

mRNA ends" RNA 2:88-101 ), and is probably retained because the C-terminal region of the 
VV subunit overlaps the next open reading frame ( J4R) which encodes a 22kDa subunit of 
the VV RNA polymerase (Goebel et al [1990] supra). 
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One functional hypothesis to account for the presence of an additional small poly(A) 
subunit is suggested by the multiple activities of the poly(A) polymerase itself (Gershon, P. 
D., Shi, X. N., and Hodel, A. E. [1998] "Evidence that the RNA methylation and poly(A) 
polymerase stimulatory activities of vaccinia virus protein VP39 do not impinge upon one 
5 another" Virology 246:253-265). VV VP55 catalyzes the initial (-35 base) addition of 3 f 

poly(A) to newly synthesized mRN A or 5' phosphorylated nucleotide primers (Gershon, P. 
D., Ahn, B. Y., Garfield, M., and Moss, B. [1991] 'ToIy(A) polymerase and a dissociable 
polyadenylation stimulatory factor encoded by vaccinia virus" Cell 66:1269-1278). The 
small subunit, VP39 has three activities. The first is to serve as a processivity factor which 

10 in the presence of VP55 extends the poly(A) length to several hundred A residues (Gershon, 

P. D. and Moss, B. [1993] "Stimulation of poly(A) tail elongation by the VP39 subunit of 
the vaccinia virus-encoded poly(A) polymerase" J. Biol. Chem. 268:2203-2210). The 
second, distinct activity, mediated by VP39 alone, is an mRNA cap-specific 2'-0- 
methyltransferase (Schnierle et al. [1992] supra). The third activity is an associated 

15 transcription elongation factor (Latner, D. R.,Xiang, Y„ Lewis, J. I., Condit, J., and Condit 

R. C. [2000] 'The vaccinia virus bifunctional gene J3 ( nucleoside-2 '-0-)-methyltransferase 
and poly(A) polymerase stimulatory factor is implicated as a positive transcription 
elongation factor by two genetic approaches" Virology 269:345-355). It is possible that these 
various activities have been distributed amongst the two subunits. Alternatively, one of the 

20 subunits may have evolved to fulfill an entirely unrelated function. 

AMV050 (SEQ ID NO: 7) and AMV210 (SEQ ID NO: 8) encode AmEPV DNA 
polymerases. In view of our findings with the poly(A) polymerase, we would like to call 
attention to an interesting feature of EPV DNA polymerases first noted in African Swine 
Fever virus (Oliveros, M., Yanez, R. J., Salas, M. L., Salas, J., Vinuela, E., and Blanco, L. 

25 [1997] "Characterizationof an African swine fever virus 20-kDa DNA polymerase involved 

in DNA repair" J. Biol, Chem. 272:30899-30910)and later in MsEPV (Afonso et al. [1999] 
supra), which has also been found in AmEPV. The 1 105 residue AmEPV ORF AMV050 is 
similar in length, and homologous to typical poxvirus encoded DNA polymerases. A second 
smaller (612 amino acids) AmEPV encoded ORF, AM V2 1 0, shares a 460 amino acid region 
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of clear homology with AMV050, although both proteins possess completely unique regions; 
i.e., the N-terminus of AM V050 (residues 1 -645) and C -terminus of AM V2 1 0 (residues 463- 
612). Both proteins have been found to contain DNA polymerase motifs (Table 1 ). 

AMV130 (SEQ ID NO: 9) encodes an AmEPV ABC transporter-like protein. 
5 AMV130 represents the largest ORF in AmEPV. The 1384 residue protein shows homology 

to the ATP-binding cassette (ABC) proteins. These are a large gene family found from 
bacteria to man, and have a variety of functions (van Veen, H. W. and Konings, W. N. 
[1998] "Structure and function of multidrug transporters"^^ Exp. Med. Biol. 456:145- 
158). While most are ATP-driven membrane translocators, some act as ion channels, ion 

10 channel regulators, receptors, proteases, immune regulators and even sensing proteins 

(Bauer, B. E., Wolfger, H., and Kuchler, K. [1999] "Inventory and function of yeast ABC 
proteins: about sex, stress, pleiotropic drug and heavy metal resistance" Biochim. Biophys. 
Acta 1461:217-236;Klein,I.,Sarkadi,B.,and Radi, A. [1999] "An inventory ofthe human 
ABC proteins" Biochim. Biophys. Acta 1461:237-262; Abele, R. and Tampe, R. [1999] 

1 5 "Function of the transport complex TAP in cellular immune recognition" Biochim. Biophys. 

Acta 1461:405-419). All ABC proteins share a common molecular architecture consisting 
of at least one 200-250 amino acid ABC cassette and several predicted a-helical membrane 
spanning segments (TMS or TMD). The minimum structural requirement is considered to 
be 2 ABC and 2 TMD regions, present in either 1 (full transporter) or 2 (half transporter) 

20 polypeptide chains. The AmEPV ABC protein consists of TMD-ABC-TMD-ABC domains, 

one of the structures of active ABC transporters. This arrangement of AMV130 domains is 
also found in the MDR/TAP, MRP, CFTR and ABC1 subfamilies and is associated with 
activities ranging from control of sex (yeast), drug resistance (humans, bacteria), ion 
channels (human CFTR gene) and engulfmentof dead cells (C. elegans) (Bauer et al [1999] 

25 supra; Klein et al. [1999] supra; Abele and Tampe [1999] supra). Each AmEPV TMD 

contains 6 or 7 transmembrane helices (Figure 1 1). No other virus is known to encode an 
ABC transporter. The potential ABC-transporter encoded by AmEPV may play a role in 
evading host immune defenses, e.g. facilitating removal of toxic elements from virally 
infected cells. 
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AMV007 (SEQ ID NO: 10) encodes an AmEPV Kunitz-motif protease inhibitor 
(KPI). AmEPV ORF AMV007 is located near the left end of the AmEPV genome, and 
encodes a small protein of 79 amino acids. A Prosite search revealed the presence of a 
Kunitz family signature (Prosite PS00280), a motif associated with protease inhibitors 
5 ( Figure 12). Indeed, the Kunitz-type pancreatic trypsin inhibitors represent one of the most 

common families of serine protease inhibitors. Kunitz-type inhibitors found within insects 
are typically less than 100 amino acids in length. All contain certain five invariant cysteine 
residues. AMV007 has all five cysteines and the alignment allows prediction of an arginine 
PI. The inducible serine protease inhibitor (ISP-2) of Galleria mellonella (Frobius, A. C, 

10 Kanost, M. R., Gotz, P., and Vilcinskas, A. [2000] "Isolation and characterization of novel 

inducible serine protease inhibitors from larval hemolymph of the greater wax moth Galleria 
mellonella" £wr J. Biochem. 267:2046-2053) and the hemolymph trypsin inhibitors (HLTIs 
A and B) of Manduca sexta (Ramesh, N., Sugumaran, M., and Mole, J. E. [1988] 
"Purification and characterizationof two trypsin inhibitors from the hemolymph of Manduca 

15 sexta larvae" J. Biol Chem. 263:1 1523-1 1527) are both Kunitz-type inhibitors that contain 

PI residues of arginine, and inhibit trypsin-like proteases. Structurally, Kunitz-type 
inhibitors are comprised of short alpha/beta proteins with little secondary structure. 
A 1 though wi despread i n nature, there are no reports of the presence of a Kunitz-type protease 
inhibitor (KPI) from this family in any viral genome. It is interesting to note that vertebrate 

20 poxviruses do encode protease inhibitors, but they are members of a different family (the 

serine protease inhibitor, serpin) family. The vertebrate poxvirus serpins have been shown 
to have an immunoregulatory role in the infected vertebrate host (Turner, S., Kenshole, B., 
and Ruby, J. [1999] "Viral modulation of the host response via crmA/SPI-2 expression" 
Immunology and Cell Biology 77:236-241 ;McFadden, G. [1995] In: Viroceptors, virokines 

25 and related immune modulators encoded by DNA viruses, R. G. Landes/Springer-Verlag, 

Austin, TX; McFadden, G., Graham, K., and Barry, M. [1996] "New Strategies of immune 
modulation by DNA viruses" Transplant. Proc. 28:2085-2088). We propose that the 
AmEPV KPI protein may fulfill a similar immunoregulatoryrole in the infected invertebrate 
host, but may target different pathways than do the serpins which control inflammation, 
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apoptosis and the host immune response (Turner and Moyer [1998] supra; Turner, P. C, 
Musy, P. Y., and Moyer, R. W. [1995] Poxvirus Serpins, In "Viroceptors, Virokines and 
related immune modulators encoded by DNA viruses," G. McFadden, ed., pp. 67-88. R. G. 
Landes, Galveston, TX). 

5 One function the KPI protein may possess is suggested by the physiology of the 

insect host. The haemolymph of insects contains relatively high concentrations of a variety 
of protease inhibitors from several different gene families (Kanost, M. R. [1999] "Serine 
proteinase inhibitors in arthropod immunity" Developmental and Comparative Immunology 
23:291-301; Jiang, H. B. and Kanost, M. R. [1997] "Characterizationand functional analysis 

10 of 12 naturally occurring reactive site variants of serpin-1 from Manduca sexta" J. Biol 

Chem. 272:1082-1087). Protease inhibitors from the Kunitz family have been identified as 
haemolymph proteins from lepidopteran insect species (Sugumaran, M., Saul, S. J., and 
Ramesh, N. [1985] "Endogenous protease inhibitors prevent undesired activation of 
prophenolase in insect hemolymph" Biochem. Biophys. Res. Commun. 132:1124-1129; 

15 Sasaki, T. [1984] "Amino acid sequence of a novel Kunitz-type chymotrysin inhibitor from 

hemolymph of silkworm larvae" Bomhyx moorei. FEBS Lett. 168:230), which function as 
inhibitors of trypsin or chymotrypsin. These host KPI proteins have been shown to be 
important in the avoidance of inopportune chymotrypsin-mediated activation of 
prophenyloxidase (Saul, S. J. and Sugumaran, M. [1986] "Protease inhibitor controls 

20 prophenoloxidase activation in Manduca sexta" FEBS Lett. 208:113-116; Aso, Y., 

Yamashita, T., Meno, K., and Murakami, M. [1994] "Inhibition of prophenoloxidase- 
activating enzyme from Bombyx mori by endogenous chymotrypsin inhibitors" Biochem. 
Mol. Biol Int. 33:751-758). This enzyme is an early component of the cascade required by 
the insect immune system to produce melanin, which is used to engulf and overcome 

25 invading foreign objects (Gillespie, J. P.,Kanost, M. R., and Trenczek,T. [1997] "Biological 

mediators of insect immunity" Annu. Rev. Entomol. 42:61 1-43, 61 1-643; Vilmos, P. and 
Kurucz, E. [1998] "Insect immunity: evolutionary roots of the mammalian innate immune 
system" Immunol. Lett. 62:59-66). Production of such a protein by an infecting virus may 



(i SII-APPS U1-22IC I XC1 wpd DNB la 



46 UF-221C1XC1 

therefore lessen the amount of prophenyloxidase induced by the insect immune system 
during infection. 

Polynucleotides of the subject invention include sequences identified in the attached 
sequence listing as well as the tables and figures and described by open reading frame (ORF) 
5 position within the genome. In addition, the subject invention includes polynucleotides 

which hybridize with other polynucleotides of the subject invention. 

Additional uses of polynucleotides . The polynucleotide sequences exemplified 
herein can be used in a variety of ways, having numerous applications in techniques known 
to those skilled in the art of molecular biology having the instant disclosure. These 
10 techniques include their use as hybridization probes, for chromosome and gene mapping, in 

PCR technologies, and in the production of sense or antisense nucleic acids. 

These polynucleotides can be used in assays for additional polynucleotides and 
additional homologous genes, and can be used in tracking the quantitative and temporal 
expression of these genes in cells and organisms. Polynucleotides of the subject invention 
15 may be used as insertion sites for foreign genes of interest. 

Antisense technology can also be used to interfere with expression of the disclosed 
polynucleotides. For example, the transformation of a cell or organism with the reverse 
complement of a gene encoded by a polynucleotide exemplified herein can result in strand 
co-suppression and silencing or inhibition of a target gene, e.g., one involved in the infection 
20 process. 

Polynucleotides disclosed herein are useful as target genes for the synthesis of 
antisense RNA or dsRNA useful for RNA-mediated gene interference. The ability to 
specifically inhibit gene function in a variety of organisms utilizing antisense RNA or ds 
RNA-mediated interference is well known in the fields of molecular biology (see for 
25 example CP. Hunter, Current Biology [1999] 9:R440-442; Hamilton et al., [1999] Science, 

286:950-952; and S. W. Ding, Current Opinions in Biotechnology [2000] 1 1:152-156, hereby 
incorporated by reference in their entireties). dsRNA (RNAi) typically comprises a 
polynucleotide sequence identical or homologous to a target gene (or fragment thereof) 
linked directly, or indirectly, to a polynucleotide sequence complementary to the sequence 
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of the target gene (or fragment thereof). The dsRNA may comprise a polynucleotide linker 
sequence of sufficient length to allow for the two polynucleotide sequences to fold over and 
hybridize to each other; however, a linker sequence is not necessary. The linker sequence 
is designed to separate the antisense and sense strands of RNAi significantly enough to limit 
5 the effects of steric hindrances and allow for the formation of dsRNA molecules and should 

not hybridize with sequences within the hybridizing portions of the dsRNA molecule. The 
specificity of this gene silencing mechanism appears to be extremely high, blocking 
expression only of targeted genes, while leaving other genes unaffected. Accordingly, one 
method for controlling gene expression according to the subject invention provides materials 
10 and methods using double-stranded interfering RNA (dsRNAi), or RNA-mediated 

interference (RNAi). The terms dsRNAi and RNAi are used interchangeably herein unless 
otherwise noted. 

RNA containing a nucleotide sequence identical to a fragment of the target gene is 
preferred for inhibition; however, RNA sequences with insertions, deletions, and point 

15 mutations relative to the target sequence can also be used for inhibition. Sequence identity 

may optimized by sequence comparison and alignment algorithms known in the art (see 
Gribskov and Devereux, Sequence Analysis Primer , Stockton Press, 1991, and references 
cited therein) and calculating the percent difference between the nucleotide sequences by, 
for example, the Smith- Waterman algorithm as implemented in the BESTFIT software 

20 program using default parameters (e.g., University of Wisconsin Genetic Computing Group). 

Alternatively, the duplex region of the RNA may be defined functionally as a nucleotide 
sequence that is capable of hybridizing with a fragment of the target gene transcript. 

RNA may be synthesized either in vivo or in vitro. Endogenous RNA polymerase 
of the cell may mediate transcription in vivo* or cloned RNA polymerase can be used for 

25 transcription in vivo or in vitro. For transcription from a transgene in vivo or an expression 

construct, a regulatory region (e.g., promoter, enhancer, silencer, splice donor and acceptor, 
polyadenylation)may be used to transcribe the RNA strand (or strands); the promoters may 
be known inducible promoters such as baculovirus. Inhibition may be targeted by specific 
transcription in an organ, tissue, or cell type. The RNA strands may or may not be 
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polyadenylated; the RNA strands may or may not be capable of being translated into a 
polypeptide by a cell's translational apparatus. RNA may be chemically or enzymatically 
synthesized by manual or automated reactions. The RNA may be synthesized by a cellular 
RNA polymerase or a bacteriophage RNA polymerase (e T3, T7, SP6). The use and 
5 production of an expression construct are known in the art (see, for example, WO 97/320 1 6; 

U.S. Pat. Nos. 5,593,874; 5,698,425; 5,712,135; 5,789,214; and 5,804,693; and the 
references cited therein). If synthesized chemically or by in vitro enzymatic synthesis, the 
RNA may be purified prior to introduction into the cell. For example, RNA can be purified 
from a mixture by extraction with a solvent or resin, precipitation, electrophoresis, 

10 chromatography, or a combination thereof. Alternatively, the RNA may be used with no or 

a minimum of purification to avoid losses due to sample processing. The RNA may be dried 
for storage or dissolved in an aqueous solution. The solution may contain buffers or salts to 
promote annealing, and/or stabilization of the duplex strands. 

Preferably and most conveniently, dsRNAi can be targeted to an entire 

15 polynucleotide sequence set forth herein. Preferred RN Ai molecules of the instant invention 

are highly homologous or identical to the polynucleotides of the sequence listing. The 
homology may be greater than 70%, preferably greater than 80%, more preferably greater 
than 90% and is most preferably greater than 95%. 

Fragments of genes can also be utilized for targeted suppression of gene expression. 

20 These fragments are typically in the approximate size range of about 20 nucleotides. Thus, 

targeted fragments are preferably at least about 15 nucleotides. In certain embodiments, the 
gene fragment targeted by the RNAi molecule is about 20-25 nucleotides in length. In a 
more preferred embodiment, the gene fragments are at least about 25 nucleotides in length. 
In an even more preferred embodiment, the gene fragments are at least 50 nucleotides in 

25 length. 

Thus, RNAi molecules of the subject invention are not limited to those that are 
targeted to the full-length polynucleotide or gene. Gene product can be inhibited with a 
RNAi molecule that is targeted to a portion or fragment of the exemplified polynucleotides; 
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high homology (90-95%) or greater identity is also preferred, but not necessarily essential, 
for such applications. 

In another aspect of the invention, the dsRNA molecules of the invention may be 
introduced into cells with single stranded (ss) RN A molecules which are sense or anti-sense 
5 RNA derived from the nucleotide sequences disclosed herein. Methods of introducing 

ssRNA and dsRNA molecules into cells are well-known to the skilled artisan and includes 
transcription of plasmids, vectors, or genetic constructs encoding the ssRNA or dsRNA 
molecules according to this aspect of the invention; electroporation,biolistics,or other well- 
known methods of introducing nucleic acids into cells may also be used to introduce the 

10 ssRNA and dsRNA molecules of this invention into cells. 

Other aspects of the invention include use of the disclosed sequences or recombinant 
nucleic acids derived therefrom to produce purified peptides. The nucleotide sequences as 
disclosed herein may be used to produce an amino acid sequence using well known methods 
of recombinant DNA technology. Goeddel (Gene Expression Technology, Methods and 

15 Enzymology [1990] Vol 185, Academic Press, San Diego, CA) is one among many 

publications which teach expression of an isolated, purified nucleotide sequence. The amino 
acid or peptide may be expressed in a variety of host cells, either prokaryotic or eukaryotic. 
Host cells may be from the same species from which the nucleotide sequence was derived 
or from a different species. 

20 Still further aspects of the invention use these purified peptides to produce antibodies 

or other molecules able to bind to the peptides. These antibodies or binding agents can then 
be used for the screening of cells in order to localize the cellular distribution of the peptides 
or proteins. The antibodies are also useful for the affinity purification of recombinantly 
produced peptides or proteins. 

25 The disclosed nucleotide sequences can be used individually, or in panels, in tests or 

assays to detect levels of peptide, polypeptide, or protein expression. The form of such 
qualitative or quantitative methods may include northern analysis, dot blot or other 
membrane based technologies, dip stick, pin or chip technologies, PCR, ELISAs or other 
multiple sample format technologies. 
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The subject invention also provides polynucleotides identified as control elements 
or regulatory sequences, such as gene promoters, enhancers, introns and untranslated regions 
which interact with cellular components to carry out regulatory functions such as replication, 
transcription, and translation. The invention further comprises the use of the disclosed 
5 polynucleotide sequences, or fragments thereof, in assays to characterize and/or identify 

sequences having promoter or other regulatory activity. Also contemplated according to the 
subject invention is the use of oligomers from these sequences in kits which can be used to 
identify promoters or other regulatory sequences. 

As used herein, the following definitions apply: 

10 An "oligonucleotide" or "oligomer" is a stretch of nucleotide residues which has a 

sufficient number of bases to be used in a polymerase chain reaction (PCR). These short 
sequences are based on (or designed from) genomic or cDNA sequences and are used to 
amplify, confirm, or reveal the presence of an identical, similar or complementary DNA or 
RNA in a particular cell or tissue. Oligonucleotides or oligomers comprise portions of a 

15 DNA sequence having at least about 10 nucleotides and as many as about 50 nucleotides, 

preferably about 1 5 to 30 nucleotides. They can be chemically synthesized and may be used 
as probes. 

"Probes" are nucleic acid sequences of variable length, preferably between at least 
about 10 and as many as about 6,000 nucleotides, depending on use. They are used in the 
20 detection of identical, similar, or complementary nucleic acid sequences. Longer length 

probes are usually obtained from a natural or recombinant source, are highly specific and 
much slower to hybridize than oligomers. They may be single- or double-stranded and 
designed to have specificity in PCR, hybridization membrane-based, or ELISA-like 
technologies. 

25 "Reporter" molecules are chemical moieties used for labeling a nucleic or amino acid 

sequence. They include, but are not limited to, radionuclides, enzymes, fluorescent, 
chemi-luminescent, or chromogenic agents. Reporter molecules associate with, establish the 
presence of, and may allow quantification of a particular nucleic or amino acid sequence. 



G: SI I-AIM'S VY-22 1 C 1 XC I wpd DNR la 



51 UF-221C1XC1 

A "portion" or "fragment" of a polynucleotide or nucleic acid comprises all or any 
part of the nucleotide sequence having fewer nucleotides than about 6 kb, preferably fewer 
than about 1 kb which can be used as a probe. Such probes may be labeled with reporter 
molecules using nick translation, Klenow fill-in reaction, PCR or other methods well known 
5 in the art. After pretesting to optimize reaction conditions and to eliminate false positives, 

nucleic acid probes may be used in Southern, northern or in situ hybridizations to determine 
whether target DNA or RNA is present in a biological sample, cell type, tissue, organ or 
organism. 

"Recombinant nucleotide variants" are alternate polynucleotides which encode a 

10 particular protein. They may be synthesized, for example, by making use of the 

"redundancy" in the genetic code. Various codon substitutions, such as the silent changes 
which produce specific restriction sites or codon usage-specific mutations, may be 
introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic host system, respectively. 

15 "Linkers" are synthesized palindromic nucleotide sequences which create internal 

restriction endonuclease sites for ease of cloning the genetic material of choice into various 
vectors. "Polylinkers" are engineered to include multiple restriction enzyme sites and 
provide for the use of both those enzymes which leave 5' and 3' overhangs such as BamHI, 
EcoRI, PstI, Kpnl and Hind III or which provide a blunt end such as EcoRV, SnaBI and Stul. 

20 "Control elements" or "regulatory sequences" are regions of the gene or DNA such 

as enhancers, promoters, introns and 3' untranslated regions which interact with cellular 
proteins to carry out replication, transcription, and translation. Typically, these regions are 
nontranslated. They may occur as boundary sequences or even split the gene. They function 
at the molecular level and along with regulatory genes are very important in development, 

25 growth, differentiation and aging processes. 

"Chimeric" molecules are polynucleotides or polypeptides which are created by 
combining one or more nucleotide peptide sequences (or their parts). In the case of 
nucleotide sequences, such combined sequences may be introduced into an appropriate 
vector and expressed to give rise to a chimeric polypeptide which may be expected to be 
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different from the native molecule in one or more of the following characteristics: cellular 
location, distribution, ligand-binding affinities, interchain affinities, degradation/turnover 
rate, signaling, etc. 

"Active" is that state which is capable of being useful or of carrying out some role. 
5 It specifically refers to those forms, fragments, or domains of an amino acid sequence which 

display the biologic and/or immunogenic activity characteristic of the naturally occurring 
peptide, polypeptide, or protein. 

"Naturally occurring" refers to a polypeptide produced by cells which have not been 
genetically engineered or which have been genetically engineered to produce the same 
10 sequence as that naturally produced. 

"Derivative" refers to those polypeptides which have been chemically modified by 
such techniques as ubiquitination, labeling, pegylation (derivatization with polyethylene 
glycol), and chemical insertion or substitution of amino acids such as ornithine which do not 
normally occur in proteins. 
15 "Recombinant polypeptide variant" refers to any polypeptide which differs from 

natural ly occurring peptide, polypeptide, or protein by amino acid insertions, deletions and/or 
substitutions. 

Amino acid "substitutions" are defined as one for one amino acid replacements. They 
are conservative in nature when the substituted amino acid has similar structural and/or 
20 chemical properties. Examples of conservative replacements are substitution of a leucine 

with an isoleucine or valine, an aspartate with a glutamate, or a threonine with a serine. 

Amino acid "insertions" or "deletions" are changes to or within an amino acid 
sequence. They typically fall in the range of about 1 to 5 amino acids. The variation allowed 
in a particular amino acid sequence may be experimentally determined by producing the 
25 peptide synthetically or by systematically making insertions, deletions, or substitutions of 

nucleotides in the sequence using recombinant DNA techniques. 

A "signal or leader sequence" is a short amino acid sequence which can be used, 
w hen desired, to direct the polypeptide through a membrane of a cell. Such a sequence may 
be naturally present on the polypeptides of the present invention or provided from 
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heterologous sources by recombinant DNA techniques. Such sequences include nuclear 
localization sequences (NLS) known in the art. 

An "oligopeptide" is a short stretch of amino acid residues and may be expressed 
from an oligonucleotide. Such sequences comprise a stretch of amino acid residues of at 
5 least about 5 amino acids and often about 17 or more amino acids, typically at least about 

9 to 1 3 amino acids, and of sufficient length to display biologic and/or immunogenic activity. 

An "inhibitor" is a substance which retards or prevents a chemical or physiological 
reaction or response. Common inhibitors include but are not limited to antisense molecules, 
antibodies, antagonists and their derivatives. 
10 A "standard" is a quantitativeor qualitative measurement for comparison. Preferably, 

it is based on a statistically appropriate number of samples and is created to use as a basis of 
comparison when performing diagnostic assays, running clinical trials, or following patient 
treatment profiles. The samples of a particular standard may be normal or similarly 
abnormal. 

15 Since the list of technical and scientific terms cannot be all encompassing, any 

undefined terms shall be construed to have the same meaning as is commonly understood 
by one of skill in the art to which this invention belongs. Furthermore, the singular forms 
"a", "an" and "the" include plural referents unless the context clearly dictates otherwise. 

The invention is not to be limited only to the particular sequences, variants, 

20 formulations or methods described. The sequences, variants, formulations and 

methodologies may vary, and the terminology used herein is for the purpose of describing 
particular embodiments. The terminology and definitions are not intended to be limiting. 

Polynucleotide probes . DNA possesses a fundamental property called base 
complementarity. In nature, DNA ordinarily exists in the form of pairs of anti-parallel 

25 strands, the bases on each strand projecting from that strand toward the opposite strand. The 

base adenine (A) on one strand will always be opposed to the base thymine (T) on the other 
strand, and the base guanine (G) will be opposed to the base cytosine (C). The bases are held 
in apposition by their ability to hydrogen bond in this specific way. Though each individual 
bond is relatively weak, the net effect of many adjacent hydrogen bonded bases, together 
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with base stacking effects, is a stable joining of the two complementary strands. These 
bonds can be broken by treatments such as high pH or high temperature, and these conditions 
result in the dissociation, or "denaturation," of the two strands. If the DNA is then placed 
in conditions which make hydrogen bonding of the bases thermodynamically favorable, the 
5 DNA strands will anneal, or "hybridize," and reform the original double- stranded DNA. If 

carried out under appropriate conditions, this hybridization can be highly specific. That is, 
only strands with a high degree of base complementarity will be able to form stable double- 
stranded structures. The relationship of the specificity of hybridizationto reaction conditions 
is well known. Thus, hybridization may be used to test whether two pieces of DNA are 
10 complementary in their base sequences. It is this hybridization mechanism which facilitates 

the use of probes of the subject invention to readily detect and characterize DNA sequences 
of interest. 

The polynucleotides of the subject invention can themselves be used as probes. 
Additional polynucleotide sequences can be added to the ends of (or internally in) the 

15 exemplified polynucleotide sequences so that polynucleotides that are longer than the 

exemplified polynucleotides can also be used as probes. Thus, isolated polynucleotides 
comprising one or more of the exemplified sequences are within the scope of the subject 
invention. Polynucleotides that have less nucleotides than the exemplified polynucleotides 
can also be used and are contemplated within the scope of the present invention. For 

20 example, for some purposes, it might be useful to use a conserved sequence from an 

exemplified polynucleotide wherein the conserved sequence comprises a portion of an 
exemplified sequence. Thus, polynucleotides of the subject invention can be used to find 
additional, homologous (wholly or partially) genes. Hybridization probes of the subject 
invention may be derived from the open reading frames specifically exemplified in the 

25 sequence listing, figures, and tables as well as from surrounding or included genomic 

sequences comprising untranslated regions such as promoters, enhancers and introns. 

Probes of the subject invention may be composed of DNA, RNA, or PNA (peptide 
nucleic acid). The probe will normally have at least about 10 bases, more usually at least 
about 1 7 bases, and may have up to about 1 00 bases or more. Longer probes can readily be 
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utilized, and such probes can be, for example, several kilobases in length. The probe 
sequence is designed to be at least substantially complementary to a portion of a gene 
encoding a protein of interest. The probe need not have perfect complementarity to the 
sequence to which it hybridizes. The probes may be labeled utilizing techniques that are well 
5 known to those skilled in this art. 

One approach for the use of the subject invention as probes entails first identify ing 
DNA segments that are homologous with the disclosed nucleotide sequences using, for 
example, Southern blot analysis of a gene bank. Thus, it is possible, without the aid of 
biological analysis, to know in advance the probable activity of many new polynucleotides, 

10 and of the individual gene products expressed by a given polynucleotide. Such an analysis 

provides a rapid method for identifying commercially valuable compositions. 

One hybridization procedure useful according to the subject invention typically 
includes the initial steps of isolating the DNA sample of interest and purifying it chemically. 
Either lysed cells or total fractionated nucleic acid isolated from cells can be used. Cells can 

15 be treated using known techniques to liberate their DNA (and/or RNA). The DNA sample 

can be cut into pieces with an appropriate restriction enzyme. The pieces can be separated 
by size through electrophoresisin a gel, usually agarose or acrylamide. The pieces of interest 
can be transferred to an immobilizing membrane. 

The particular hybridization technique is not essential to the subject invention. As 

20 improvements are made in hybridization techniques, they can be readily applied. 

The probe and sample can then be combined in a hybridization buffer solution and 
held at an appropriate temperature until annealing occurs. Thereafter, the membrane is 
washed free of extraneous materials, leaving the sample and bound probe molecules typically 
detected and quantified by autoradiography and/or liquid scintillation counting. As is well 

25 known in the art, if the probe molecule and nucleic acid sample hybridize by forming a 

strong non-covalentbond between the two molecules, it can be reasonably assumed that the 
probe and sample are essentially identical or very similar. The probe's detectable label 
provides a means for determining in a known manner whether hybridization has occurred. 
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In the use of the nucleotide segments as probes, the particular probe is labeled with 
any suitable label known to those skilled in the art, including radioactive and non-radioactive 
labels. Typical radioactive labels include 32 P, :>5 S, or the like. Non-radioactive labels 
include, for example, ligands such as biotin or thyroxine, as well as enzymes such as 
5 hydrolases or peroxidases, or the various chemiluminescers such as luciferin, or fluorescent 

compounds like fluorescein and its derivatives. In addition, the probes can be made 
inherently fluorescent as described in International Application No. WO 93/16094. 

Various degrees of stringency of hybridization can be employed. The more stringent 
the conditions, the greater the complementarity that is required for duplex formation. 
10 Stringency can be controlled by temperature, probe concentration, probe length, ionic 

strength, time, and the like. Preferably, hybridization is conducted under moderate to high 
stringency conditions by techniques well known in the art, as described, for example, in 
Keller, G.H., M.M. Manak (1987) DNA Probes, Stockton Press, New York, NY., pp. 169- 
170. 

15 As used herein "moderate to high stringency" conditions for hybridization refers to 

conditions that achieve the same, or about the same, degree of specificity of hybridization 
as the conditions "as described herein." Examples of moderate to high stringency conditions 
are provided herein. Specifically,hybridizationof immobilized DNA on Southern blots with 
32 P-labeled gene-specific probes was performed using standard methods (Maniatis et al). 

20 In general, hybridization and subsequent washes were carried out under moderate to high 

stringency conditions that allowed for detection of target sequences with homology to 
sequences exemplified herein. For double-stranded DNA gene probes, hybridization was 
carried out overnight at 20-25° C below the melting temperature (Tm) of the DNA hybrid 
in 6X SSPE, 5X Denhardt's solution, 0.1% SDS, 0.1 mg/ml denatured DNA. The melting 

25 temperature is described by the following formula from Beltz et al. (1983): 

Tm - 81.5°C + 16.6 Log[Na+]+0.41(% G+C)-0.61(% formamide)-600/length of 
duplex in base pairs. 

Washes are typically carried out as follows: 
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(1 ) Twice at room temperature for 15 minutes in IX SSPE, 0.1% SDS (low 
stringency wash). 

(2) Once at Tm-20°C for 15 minutes in 0.2X SSPE, 0.1% SDS (moderate 
stringency wash). 

5 For oligonucleotide probes, hybridization was carried out overnight at 10-20°C 

below the melting temperature (Tm) of the hybrid in 6X SSPE, 5X Denhardt's solution, 
0.1% SDS, 0.1 mg/ml denatured DNA. Tm for oligonucleotide probes was determined by 
the following formula from Suggs et al (1981): 

10 Tm (°C)=2(number 11 A base pairs) +4(number G/C base pairs) 

Washes were typically carried out as follows: 

(1) Twice at room temperature for 15 minutes IX SSPE, 0.1% SDS (low 
stringency wash). 

(2) Once at the hybridization temperature for 15 minutes in IX SSPE, 0.1% SDS 
15 (moderate stringency wash). 

In general, salt and/or temperature can be altered to change stringency. With a 
labeled DNA fragment of greater than about 70 or so bases in length, the following 
conditions can be used: 

Low: 1 or 2X SSPE, room temperature 

20 Low: 1 or2XSSPE,42°C 

Moderate: 0.2X or IX SSPE, 65 °C 
High: 0.1XSSPE,65°C. 
Duplex formation and stability depend on substantial complementarity between the 
two strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
25 Therefore, polynucleotide sequences of the subject invention include mutations (both single 

and multiple), deletions, and insertions in the described sequences, and combinations thereof, 
wherein said mutations, insertions, and deletions permit formation of stable hybrids with a 
target polynucleotideof interest. Mutations, insertions, and deletions can be produced in a 
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given polynucleotide sequence using standard methods known in the art. Other methods may 
become known in the future. 

The mutational, insertional, and deletional variants of the polypeptide sequences of 
the invention can be used in the same manner as the exemplified polynucleotide sequences 
5 so long as the variants have substantial sequence similarity with the original sequence. As 

used herein, substantial sequence similarity refers to the extent of nucleotide similarity that 
is sufficient to enable the variant polynucleotide to function in the same capacity as the 
original sequence. Preferably, this similarity is greater than 50%; more preferably, this 
similarity is greater than 75%; and most preferably, this similarity is greater than 90%. The 
10 degree of similarity needed for the variant to function in its intended capacity will depend 

upon the intended use of the sequence. It is well within the skill of a person trained in this 
art to make mutational, insertional, and deletional mutations that are designed to improve the 
function of the sequence or otherwise provide a methodological advantage. 

In one embodiment, the genes of the subject invention have at least one of the 
15 following characteristics: 

said gene is encoded by a nucleotide sequence which hybridizes under stringent 
conditions with a nucleotide sequence selected from the group consisting of: DNA 
which encodes SEQ ID NO: 1, DNA which encodes SEQ ID NO: 2, DNA which 
encodes SEQ ID NO: 3, DNA which encodes SEQ ID NO: 4, DNA which encodes 
20 SEQ ID NO: 5 or SEQ ID NO: 6, DNA which encodes SEQ ID NO: 7 or SEQ ID 

NO: 8. DNA which encodes SEQ ID NO: 9, DNA which encodes SEQ ID NO: 10, 
and DNA which encodes SEQ ID NO: 1 1 . 

The subject invention also includes polynucleotides that hybridize with other 
polynucleotides of the subject invention. 
25 PCR technology . Polymerase Chain Reaction (PCR) is a repetitive, enzymatic, 

primed synthesis of a nucleic acid sequence. This procedure is well known and commonly 
used by those skilled in this art (see U.S. Patent Nos. 4,683, 1 95, 4,683,202, and 4,800, 1 59; 
Saiki et al, 1985). PCR is based on the enzymatic amplification of a DNA fragment of 
interest that is Hanked by two oligonucleotide primers that hybridize to opposite strands of 
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the target sequence. The primers are oriented with the 3' ends pointing towards each other. 
Repeated cycles of heat denaturation of the template, annealing of the primers to their 
complementary sequences, and extension of the annealed primers with a DN A polymerase 
result in the amplification of the segment defined by the 5' ends of the PCR primers. Since 
5 the extension product of each primer can serve as a template for the other primer, each cycle 

essentially doubles the amount of DNA fragment produced in the previous cycle. This 
results in the exponential accumulation of the specific target fragment, up to several million- 
fold in a few hours. By using a thermostable DNA polymerase such as Taq polymerase, 
which is isolated from the thermophilic bacterium Thermits aquaticus* the amplification 
10 process can be completely automated. Other enzymes that can be used are known to those 

skilled in the art. 

The polynucleotide sequences of the subject invention (and portions thereof such as 
conserved regions and portions that serve to distinguish these sequences from previously- 
known sequences) can be used as, and/or used in the design of, primers for PCR 

15 amplification. In performing PCR amplification, a certain degree of mismatch can be 

tolerated between primer and template. Therefore, mutations, deletions, and insertions 
(especially additions of nucleotides to the 5' end) of the exemplified ppolynucleotides can 
be used in this manner. Mutations, insertions and deletions can be produced in a given 
primer by methods known to an ordinarily skilled artisan. 

20 Full length genes may be cloned utilizing partial nucleotide sequence and various 

methods known in the art. Gobinda et al (1993; PCR Methods Applic 2:318-22) disclose 
"restriction-site PCR" as a direct method which uses universal primers to retrieve unknown 
sequence adjacent to a known locus. First, genomic DNA is amplified in the presence of 
primer to linker and a primer specific to the known region. The amplified sequences are 

25 subjected to a second round of PCR with the same linker primer and another specific primer 

internal to the first one. Products of each round of PCR are transcribed with an appropriate 
RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to acquire unknown sequences starting with primers based 
on a known region (Triglia T. et al. (1988) Nucleic Acids Res 16:81 86). The method uses 
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several restriction enzymes to generate a suitable fragment in the known region of a gene. 
The fragment is then circularized by intramolecular ligation and used as a PCR template. 
Divergent primers are designed from the known region. The multiple rounds of restriction 
enzyme digestions and ligations that are necessary prior to PCR make the procedure slow 
5 and expensive (Gobinda et al [1993] supra). 

Capture PCR (Lagerstrom M. et al (1991) PCR Methods Applic 1:111-19) is a 
method for PCR amplification of DNA fragments adjacent to a known sequence in 
eucaryotic and YAC DNA. As noted by Gobinda et al (\993jupra), capture PCR also 
requires multiple restriction enzyme digestions and ligations to place an engineered 

10 double-stranded sequence into an unknown portion of the DNA molecule before PCR. 

Although the restriction and ligation reactions are carried out simultaneously, the 
requirements for extension, immobilization and two rounds of PCR and purification prior to 
sequencing render the method cumbersome and time consuming. 

Parker J. D. et al (Nucleic Acids Res [1991] 19:3055-60), teach walking PCR, a 

15 method for targeted gene walking which permits retrieval of unknown sequences. 

PromoterFinder™is a kit available from Clontech Laboratories, Inc. (Palo Alto, CA) which 
uses PCR and primers derived from p53 to walk in genomic DNA. Nested primers and 
special PromoterFinder™ libraries are used to detect upstream sequences such as promoters 
and regulatory elements. This process avoids the need to screen libraries and is useful in 

20 finding intron/exon junctions. 

A new PCR method replaces methods which use labeled probes to screen plasmid 
libraries and allow one researcher to process only about 3-5 genes in 14-40 days. In the first 
step, which can be performed in about two days, any two of a plurality of primers are 
designed and synthesized based on a known partial sequence. In step 2, which takes about 

25 six to eight hours, the sequence is extended by PCR amplification of a selected library. Steps 

3 and 4, which take about one day, are purification of the amplified cDNA and its ligation 
into an appropriate vector. Step 5, which takes about one day, involves transforming and 
growing up host bacteria. In step 6, which takes approximately five hours, PCR is used to 
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screen bacterial clones for extended sequence. The final steps, which take about one day. 
involve the preparation and sequencing of selected clones. 

If the full length cDNA has not been obtained, the entire procedure is repeated using 
either the original library or some other preferred library. The preferred library may be one 
5 that has been size-selected to include only larger cDNAs or may consist of single or 

combined commerciallyavailabIelibraries,e.g\, from Clontech Laboratories, Inc. (Palo Alto, 
CA). The cDNA library may have been prepared with oligo (dT) or random priming. 
Random primed libraries are preferred in that they will contain more sequences which 
contain 5' ends of genes. A randomly primed library may be particularly useful if an oligo 
10 (dT) library does not yield a complete gene. It must be noted that the larger and more 

complex the protein, the less likely it is that the complete gene will be found in a single 
plasmid. 

CLONTECH PCR-Select™ cDNA Subtraction (Clontech Laboratories, Inc., Palo 
Alto, CA) is yet another means by which differentially expressed genes may be isolated. The 

15 procedure allows for the isolation of transcripts present in one mRNA population which is 

absent, or found in reduced numbers, in a second population of mRNA. Rare transcripts may 
be enriched 1000-fold. 

A new method for analyzing either the size or the nucleotide sequence of PCR 
products is capillary electrophoresis. Systems for rapid sequencing are available from Perkin 

20 Elmer (Foster City CA), Beckman Instruments (Fullerton, CA), and other companies. 

Capillary sequencing employs flowable polymers for electrophoretic separation, four 
different fluorescent dyes (one for each nucleotide) which are laser activated, and detection 
of the emitted wavelengths by a charge coupled devise camera. Output/light intensity is 
converted to electrical signal using appropriate software (eg. Genotyper™ and Sequence 

25 Navigators™ from Perkin Elmer) and the entire process from loading of samples to computer 

analysis and electronic data display is computer controlled. Capillary electrophoresis 
provides greater resolution and is many times faster than standard gel based procedures. It 
is particularly suited to the sequencing of small pieces of DNA which might be present in 
limited amounts in a particular sample. The reproducible sequencing of up to 350 bp of Ml 3 
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phage DNA in 30 min has been reported (Ruiz-Martinez M. C. et al. [1993] Anal Chem 
65:2851-8). 

Polynucleotides and proteins . Polynucleotides of the subject invention can be 
defined according to several parameters. One characteristic is the biological activity of the 
5 protein products as identified herein. The proteins and genes of the subject invention can be 

further defined by their amino acid and nucleotide sequences. The sequences of the 
molecules can be defined in terms of homology to certain exemplified sequences as well as 
in terms of the ability to hybridize with, or be amplified by, certain exemplified probes and 
primers. Additional primers and probes can readily be constructed by those skilled in the art 

10 such that alternate polynucleotide sequences encoding the same amino acid sequences can 

be used to identify and/or characterize additional genes. The proteins of the subject 
invention can also be identified based on their immunoreactivity with certain antibodies. 

The polynucleotides and proteins or polypeptides of the subject invention include 
portions, fragments, variants, and mutants of the full-length sequences as well as fusions and 

15 chimerics, so long as the encoded protein retains the characteristic biological activity of the 

proteins identified herein. As used herein, the terms ^variants" or "variations" of genes refer 
to nucleotide sequences that encode the same proteins or which encode equivalent proteins 
having equivalent biological activity. As used herein, the term "equivalent proteins" refers 
to proteins having the same or essentially the same biological activity as the exemplified 

20 proteins. 

Variations of genes may be readily constructed using standard techniques such as 
site-directed mutagenesis and other methods of making point mutations and by DNA 
shuffling, for example. In addition, gene and protein fragments can be made using 
commercially available exonucleases, endonucleases, and proteases according to standard 
25 procedures. For example, enzymes such as BaB\ can be used to systematically cut off 

nucleotides from the ends of genes. Also, genes that encode fragments may be obtained 
using a variety of restriction enzymes. Proteases may be used to directly obtain active 
fragments of these proteins. Of course, molecular techniques for cloning polynucleotides 
and producing gene constructs of interest are also well known in the art. In vitro evaluation 
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techniques, such as MAXYGEN's "Molecular Breeding" can also be applied to practice the 
subject invention. 

Because of the redundancy of the genetic code, a variety of different DNA sequences 
can encode the amino acid sequences encoded by the polynucleotide sequences disclosed 
5 herein. It is well within the skill of a person trained in the art to create these alternative DNA 

sequences encoding proteins having the same, or essentially the same, amino acid sequence. 
These variant DNA sequences are within the scope of the subject invention. As used herein, 
reference to "essentially the same" sequence refers to sequences that have amino acid 
substitutions, deletions, additions, or insertions that do not materially affect biological 
10 activity. Fragments retaining the characteristic biological activity are also included in this 

definition. 

A further method for identifying genes and polynucleotides(and the proteins encoded 
thereby) of the subject invention is through the use of oligonucleotide probes. Probes 
provide a rapid method for identifying genes of the subject invention. The nucleotide 

1 5 segments that are used as probes according to the invention can be synthesized using a DNA 

synthesizer and standard procedures. 

The subject invention comprises variant or equivalent proteins (and nucleotide 
sequences coding for equivalent proteins) having the same or similar biological activity of 
proteins encoded by the exemplified polynucleotides. Equivalent proteins will have amino 

20 acid similarity with an exemplified protein (or peptide). The amino acid identity will 

typically be greater than 60%. Preferably, the amino acid identity will be greater than 75%. 
More preferably, the amino acid identity will be greater than 80%, and even more preferably 
greater than 90%. Most preferably, amino acid identity will be greater than 95%. (Likewise, 
the polynucleotides that encode the subject polypeptides will also have corresponding 

25 identities in these preferred ranges.) These identities are as determined using standard 

alignment techniques for determining amino acid identity. The amino acid 
identity/similarity/ homology will be highest in critical regions of the protein including those 
regions that account for biological activity or that are involved in the determination of three- 
dimensional configuration that is ultimately responsible for the biological activity. In this 
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regard, certain amino acid substitutions are acceptable and can be expected if these 
substitutions are in regions which are not critical to activity or are conservative amino acid 
substitutions which do not affect the three-dimensional configuration of the molecule. For 
example, amino acids may be placed in the following classes: non-polar, uncharged polar, 
5 basic, and acidic. Conservative substitutions whereby an amino acid of one class is replaced 

with another amino acid of the same type fall within the scope of the subject invention so 
long as the substitution does not materially alter the biological activity of the compound. 
Table 4 provides a listing of examples of amino acids belonging to each class. 



Table 4. 



Class of Amino Acid 


Examples of Amino Acids 


Nonpolar 


Ala, Val, Leu, He, Pro, Met, Phe, Trp 


Uncharged Polar 


Gly, Ser, Thr, Cys, Tyr, Asn, Gin 


Acidic 


Asp, Glu 


Basic 


Lys, Arg, His 



15 

In some instances, non-conservative substitutions can also be made. The critical 
factor is that these substitutions must not significantly detract from the biological activity of 
the polypeptide. 

An "isolated" or "substantially pure" nucleic acid molecule or polynucleotide is a 
20 polynucleotide that is substantially separated from other polynucleotide sequences which 

naturally accompany a nucleic acid molecule. The term embraces a polynucleotide sequence 
which was removed from its naturally occurring environment by the hand of man. This 
includes recombinant or cloned DNA isolates, chemically synthesized analogues and 
analogues biologically synthesized by heterologous systems. An ''isolated" or "purified" 
25 protein or polypeptide, likewise, is a one removed from its naturally occurring environment. 
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Materials and Methods 

Cells and virus. 

AmEPV (Hall, R. L. and R.W. Moyer [ 1 99 1 ] supra) was replicated in IPLB-LD-652 
cells (Goodwin, R. H., J. R. Adams and M. Shapiro [1990] "Replication of the 
5 entomopoxvirus from Amsacta moorei in serum-free cultures of a gypsy moth cell line" J. 

Invertebr. Pathol 56: 190-205) which were maintained at 28°C in a 1:1 mixed medium (TE 
medium) of TC-100 media (Gibco, Gaithersburg, MD) and EX-CELL 401 media (JRH 
Biosciences, Lenexa, KS), supplemented with 10% fetal bovine serum. A TK negative cell 
line designated CI 1.3 was selected by a process of adaption of TK(+) LD652 cell to 
10 increasing levels, 10 ^g/ml every 5 weeks, of 5-bromo-2 / -deoxyuridine (BudR) over one 

year up to 100 ,ug/ml BudR and maintained in TE medium containing BudR (100 ^g/ml). 
293 cells were grown in DMEM medium supplemented with 5% fatal bovine serum. 



Plasmid construction and preparation of AmEPV recombinant . 

15 pTR-UF5 (see Figure 1, provided by the Vector Core, Gene Therapy Center, 

University of Florida) contains GFP and NeoR genes under control CMV promoter and 
herpes virus TK promoter respectively and flanked by ITR sequences of AAV. The Pst I 
fragment which contains GFP and NeoR markers was inserted into Pst I site of pTKDU (Li, 
Y., R.L. Hall, S.L. Yuan, R.W. Moyer [1998] "High level expression of Amsacta moorei 

20 entomopoxvirus Spheroidin depends on sequences within the gene" ./. Gen. Virol. 79:613- 

622) to produce pTKUF5. AmEPV recombinant with an insert in the TK gene was obtained 
as described previously (Li et al [1998] supra). 



Viral genomic DNA preparation 
25 Growth and maintenance protocols for IPLB-LD-652 cells and AmEPV are described 

in detail in Bawden et al, 2000. DNA was obtained from amplification of a single 
wtAmEPV plaque (Bawden, A. L., Li, Y., Maggard, K., and Moyer, R. W. [2000] 
Entomopoxvirus Vectors. In "Viral Vectors: Basic Science and Gene Therapy," A. Cid- 
Arregui, Ed., Eaton Publishing, Natick, MA (In Press)). Thirty 150 cm 2 dishes containing 



G: SI 1- API'S' IJ J -22 1 C I XC 1 wpd DNB. la 



66 UF-221C1XC1 

approximately 2.4 x 1 0 7 LD652 cells (Goodwin, R. H., Adams, J. R., and Shapiro, M. [ 1 990] 
"Replication of the Entomopoxvirus from Amsacta moorei in Serum-Free Cultures of a 
Gypsy Moth Cell Line'V. Invertebr. Path 56:190-205) were infected at an MOI of 0.01 with 
wild type AmEPV and incubated at 28°C. Infections (cells and medium) were harvested 6 
5 days post-infection and centrifuged at 500 x g for 15 minutes to remove cells. The 

supernatant was centrifuged at 40,000 x g for 30 minutes to pellet virus. The pellet was 
resuspended in dH 2 0 (1 00 \iL for each initial 30 mL of supernatant). DNase free RNase was 
added to a final concentration of 50 |ag/mL and incubated at 37 °C for 30 min. The sample 
and lysis buffer (lOOmM Tris pH 8.0, 10 mM EDTA, 54% sucrose, 2% SDS, lOmM 0- 
10 mercaptoethanol) were brought to 50°C, and lysis buffer was added to the sample at a 1:1 

ratio. Proteinase K was added to a final concentration of 0.6 mg/mL. The viral lysate was 
incubated overnight at 50°C. The lysate was extracted three times with 50:49:1 
phenol:chloroform:isoamylalcohol, once with chloroform, and the DNA precipitated in 0.4 
M LiCk 95% ethanol 

15 

Tsp509\ partial digest library preparation 

Ten micrograms of AmEPV DNA were digested with 5 units of 71sy?509L Two 
aliquots were removed at 3 and 6 minute time points and digestion stopped with 50 mM 
EDTA (final concentration). This method was repeated in triplicate for a total of 30 jag of 

20 digested DNA. Fragments of 2-3 kb and 4-5 kb were gel-purified separately with the Gene- 

Clean II kit (Bio 1 0 1 , Vista, C A) and ligated into the EcoRI site of the PUC 1 9 plasmid vector 
(Amersham Pharmacia Biotech UK Ltd., Chalfont, Buckinghamshire, England). The 
ligation mixture was transformed into DH5-cc competent cells and plated onto LB agar 
plates containing 50 |ig/mL ampicillin and 800 (ag/plate each IPTG and X-gal (Horton, P. 

25 and Nakai, K. [1997] ''Better prediction of protein cellular localization sites with the k 

nearest neighbors classified Ismb. 5:147-152). White colonies were isolated and grown 
overnight in 1 mL TB medium (Horton & Nakai [1997] supra) plus 50 |ig/mL ampicillin. 
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Sequence determination 

Plasmid DN A was prepared using the QIAgen BioRobot 9600 and the QI Aprep 96 
Turbo miniprep kit. Sequencing was performed with 200-500ng of plasmid DNA as 
template using a 0.25X concentration of ABI Prism BigDye Terminator Cycle Sequencing 
5 Ready Reaction kit (#43031 53; Perkin-Elmer Applied Biosystems [ABI], Foster City, CA). 

Cycle sequencing was performed using a PTC-200 DNA Engine (MJ Research, Watertown, 
MA) (25 cycles: 1 degree per second to 96 degrees; 96 for 10 seconds; 1 degree per second 
to 60 degrees; 60 for 4 minutes). Dye terminator removal was on Multiscreen-HV plates 
(Millipore) with Sephadex G-50 superfine (Sigma, St. Louis, MO) in water. The reactions 
10 were electrophoresed on an ABI 377 sequencer, and the chromatograms were edited with 

Analysis version 1.2.1 (ABI) and assembled as follows. 

Sequence Assembly and Analysis 

Chromatograms were assembled into "contigs" using the Phred/Phrap/Consed 

15 software package (Horton, P. and Nakai, K. [1996] "A probabilistic classification system for 

predicting the cellular localization sites of proteins" Ismb. 4:109-115; Altschul, S. F., 
Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. J. [1997] 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs" 
Nucleic Acids Res. 25:3389-3402; Bateman, A., Birney, E., Durbin, R., Eddy, S. R., Finn, 

20 R. D., and Sonnhammer, E. L. [1999] "Pfam 3.1: 1313 multiple alignments and profile 

HMMs match the majority of proteins" Nucleic Acids Res. 27:260-262). After assembling 
3500 chromatograms into 6 contigs, Consed designed 43 finishing experiments. Custom 
oligonucleotide primers were synthesized by Integrated DNA Technologies (Coralville, I A), 
and upon completion of the experiments, the assembly contained the entire unique region of 

25 the genome and one inverted terminal repeat (ITR). After further data manipulation using 

the programs phrapview and miropeats, the ITR regions on either end were delineated 
(Horton & Nakai [1996] supra; Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and 
Lipman, D. J. [1990] "Basic local alignment search tool" J. Mol Biol 215:403-410). The 
consensus was sequenced to an average redundancy of 1 OX. The sequence was confirmed 



(i:\SH-APPS-UF-22 1 C 1 XC 1 wpd DNB la 



68 UF-221C1XC1 

by comparisionto BamW\, EcoKl, Hind[\\,Pst\, and Xhoh restriction maps of AmEPV (Hall, 
R. L. and Flink, W. F. [1990] "Physical mapping and field inversion gel electrophoresis of 
Amsacta moorei entomopoxvirusDNA" Arch. Virol. 110:77-90). The sequence of AmEPV 
has been deposited into GenBank, nucleotide acession number AF250284. Methionine- 
5 initiated open reading frames were delineated using Vector NTI. Open reading frames that 

translated into proteins less than 60 amino acids were discarded from our analysis. Relevant 
homologies were determined by BLAST analysis (Parsons, J. D. [1995] "Miropeats: 
graphical DNA sequence comparisons" Comput. Appl. BioscL 11:615-619; Ewing, B., 
Hillier, L., WendfM. Cand Green, P. [1998] "Base-calling of automated sequencer traces 

10 using phred. I. Accuracy assessment" Genome Res. 8:175-185), and additional domains 

found using the Pfam program (Gordon, D., Abajian, C, and Green, P. [1998] "Consed: a 
graphical tool for sequence finishing 'Genome Res. 8:195-202). Default E (EXPECT) values 
of <0.01 were used to define homology to sequences in current databases. Transmembrane 
( TM), leucine zipper, and signal peptide (SP) domains were predicted by the Psort program 

15 (van Veen and Konings [1998] supra: Ewing, B. and Green, P. [1998] "Base-calling of 

automated sequencer traces using phred. II. Error probabilities" Genome Res. 8:1 86-1 94). 

Following are examples which illustrate procedures for practicing the invention. 
These examples should not be construed as limiting. All percentages are by weight and all 
solvent mixture proportions are by volume unless otherwise noted. 

20 

Example 1 — Gene expression in cells infected with recombinant AmEPV 

293 cells (1 x 10 6 ) were placed in 6-well plate and infected with recombinant 
AmEPVpTKUF5 or AmEPVpTKespgfp(Li etal. [1997] supra) viruses at a multiplicity of 
five (5) virus particles/cell. As controls, cells were separately transfected with either the 
25 plasmid pTR-UF5 or pTKUFS at a 5 ^ug/well plasmid DNA. Two days later, virus infected 

or plasmid transfected cells were transferred into 60 mm dishes, after 24 hr, neomycin 
resistant colonies were selected by adding G418 at the final concentration of 200 jug/m\. 
G41 8 containing medium was changed every 3-4 days. 
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For cells infected with recombinant AmEPV pTKespgfp, no neomycin resistant 
colony was observed, an expected result since this virus does not have NeoR gene. 
However, in cells infected with recombinant AmEPV pTKUFS or transfected with plasmids 
pTR-UF5, G41 8 resistant colonies were observed. All colonies from cells transfected with 
5 either of the two plasmids were both G41 8 resistant and GFP positive. However, colonies 

from cells infected with recombinant pTKUFS were initially only G418 resistant, and not 
GFP positive. G418 resistant colonies derived from the AmEPV recombinant also grew 
more slowly than those produced following plasmid transfection. Most likely, the 
explanation for these results is that GFP and NeoR gene copy number in AmEPV derived 
10 colonies is less than those transfected with plasmids. This explanation is likely to be true as 

we were able to show that the AmEPV derived colonies gradually become more and more 
resistant to G4 1 8 and soon, some GFP positive clusters of cells were observed which become 
more numerous and brighter. After several changes of medium, ultimately, all cells in the 
well were GFP positive. 

15 

Example 2 — Stable integration of foreign DNA sequences into mammalian cells infected 
with recombinant AmEPV 

Genomic DNA was recovered from cell lines created by either infection with the 
virus AmEPVpTKUFS or following transfection with a control plasmid pTR-UF5. 

20 Specifically, the recombinant AmEPVpTKUFS was used to infect and subsequently select 

293 (human kidney) cells at a multiplicity of 5 plaque forming units per cell, as described 
in Example 1 . After growing the isolated cell lines reliably for multiple generations, DNA 
was isolated and digested with HindlW before electrophoresis and blotted with a random 
labeled probe containing the gfp and neo genes which are contained within the ITR regions 

25 of pTR-UF5. As shown in Figure 2, lane P contains genomic DNA from 293 cells and pTR- 

UF5 plasmid, showing excision of the cassette from the plasmid upon digestion. A control 
(not shown) of 293 cells alone did not produce any endogenous cross-reacting bands. As 
seen in Figure 2, the host chromosomal site in the 293 genome of integration is random, as 
evidenced by the different sized bands resulting from Hindlll digestion. In some cell lines. 
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the event can be seen to have occurred more than once (multiple copies have integrated). 
Directional integration into the long arm of chromosome 19 would be expected if the rep 
gene of AAV were simultaneously expressed. This experimental data proves delivery and 
stable integration of foreign DNA sequences by AmEPV. 

5 

Example 3 — Growth and amplification of AmEPV 

AmEPV productively infects Lymantria J/s/?ar-derived IPLB-LD-652 (LD) cells 
(Goodwin et al, 1978). LD cells were maintained at 28°C in a 1:1 ratio of TC- 100 medium 
(Gibco, Gaithersburg, MD) to EX-CELL 401 medium (JRH Biosciences, Lenexa, KN) 

10 supplemented with 10% fetal bovine serum, 50 U/ml penicillin, and 50 |ig/ml streptomycin 

(1:1 TE). To amplify virus, cells grown in 150 mm dishes are inoculated at a multiplicity 
of infection (m.o.i.) of 0.01 in sufficient media (5 ml) to cover the surface of the tissue 
culture vessel. The cells should be no more than 70% confluent. After adsorption at 28°C 
for 2 hr, 25 ml medium is added and the infections are incubated for 4-6 days at 28°C. The 

15 infection is considered complete when most cells become occlusion body positive as seen 

by light microscopy, i.e. when refractile occlusion bodies can be seen. For recombinant 
viruses in which lacZ has been inserted into either the TK or the spheroidin locus, infection 
is monitored by in situ staining of infected cells with 1 mg/ml 5-Bromo-4-chloro-3-indolyl- 
P-D-galactopyranoside (Xgal),4 mM potassium ferricyanide, 4 mM potassium ferrocyanide, 

20 and 2 mM MgCL in phosphate buffered saline (PBS) (140 mM NaCl, 2.7 mM KC1, 10 mM 

Na 2 P0 4 , 1.8 mM KH 2 P0 4 , pH 7.4). This stain allows P-galactosidase producing infected 
cells to be visualized by the appearance of blue color. Though numbers can vary, an 
AmEPV infection at an m.o.i. of 0.01 generates a net yield of approximately 1.5 plaque 
forming units (PFU) based on the total number of cells and results in a 100-fold increase in 

25 PFU. Titers are routinely between 10 5 and 10 6 PFU/ml. Cells infected at a higher m.o.i. can 

generate higher yields/cell (-10) but the net increase in virus over input is lower. 
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Example 4 — Partial purification and concentration of virus 

Some experiments necessitate higher concentrations of virus. The procedure that 
follows typically renders semi-pure virus stocks at titers > 10 8 PFU/mL AmEPV infected 
cells are harvested by scraping and centrifuged at 700 x g for 15 min to remove cells. The 
5 supernatant is then subjected to ultracentrifugation at 70,000 x g for 2 hr to pellet the 

extracellular virus. The virus is resuspended in an appropriate amount of PBS (typically 1 00 
[il per 40 ml of supernatant) and the titer is determined by plaque assay. Total yield is 
typically 50% of input virus. 

10 Example 5 — Plaque Assay 

Virus to be titered is subjected to 1 0-fold serial dilutions in 1 : 1 TE medium. LD-652 
cells are plated at 70% confluency in 6-well dishes, each having a 34.6 mm diameter 
(roughly 1 A x 1 0 6 cells per well). Once the cells have adhered, the medium is removed and 
0.5 ml of diluted virus is added to the wells. After adsorption at 28°C for 2 hr, the inoculum 

15 is removed and 2.5 ml of overlay is added to each well. The overlay is a 2: 1 ratio of 1 .33X 

TC- 1 00 medium (containing 1 4% fetal calf serum) and 4% sterile low melting point agarose, 
equilibrated to 42°C and mixed just prior to addition to the monolayer. 

For a spheroidin-positivevirus, visible plaques appear and are counted one week post 
infection (Figure 3). Spheroidin-negative virus plaques are much more difficult to visualize, 

20 hence most spheroidin negative viruses have been engineered to contain lacZ. Such viruses 

can be readily visualized by staining with Xgal as follows: a liquid overlay of 400 \ig Xgal 
in 50 |il total solution is spread over the agarose in each well of the plaque assay 3-4 days 
post-infection. Plaques appear as blue patches of infected cells and are counted one week 
post-infection. 

25 Comet-like plaques of wtAmEPV, in the absence of an agarose overlay can be more 

rapidly visualized by immunostaining (Winter, J., R.L. Hall, and R.W. Moyer [1995] k The 
effect of inhibitors on the growth of the entomopoxvirusfrom Amsacta moorei in Lymantria 
dispar (gypsy moth) cells" Virology 21 1 :462-473). Plaque assays are prepared as above, 
except that a liquid overlay of medium replaces the agarose overlay. Infected cell 
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monolayers are air dried three days postinfection and fixed in acid alcohol (95% EtOH, 5% 
glacial acetic acid) for 30 minutes. After equilibration with TBS (0.02 M Tris pH 7.4,0.15 
M NaCl) the cells are incubated in TBS-Block (0.5% w/v blocking reagent in TBS 
[Boehringer Mannheim, Germany]) for 1 hr at room temperature to prevent nonspecific 
5 antibody binding. The primary antibody (rabbit anti-AmEPV occlusion body antiserum) 

(Hall et al [1996] supra) or secondary antibody (goat anti-rabbit conjugated to alkaline 
phosphatase; Fisher, Atlanta, GA) are both diluted in TBS-Block. Antibody reactions and 
color development are performed as previously described (Harlow. E., and D. Lane [1998] 
Antibodies- A Laboratory Manual, E. Harlow and D. Lane, Eds., pp. 635-657. Cold Spring 
10 Harbor Laboratory, Cold Spring Harbor, NY). 



Example 6 — Isolation of AmEPV genomic DNA 

LD-652 cells (typically 10 9 cells, thirty 150 mm dishes) are infected with AmEPV 

15 at an m.o.i. of 0.01. The infections (cells and medium) are harvested by scraping and 

centrifuged at 700 x g for 1 5 min. to remove cells. The supernatant is centrifuged at 39,000 
x g for 30 min. to pellet extracellular virus. The viral pellet is resuspended in deionized 
water (100 |^1 for 40 ml of supernatant). DNAase free RNAase is added to the resuspended 
viral pellet at a final concentration of 50 |ig/ml and incubated at 37°C for 30 min. The virus 

20 sample is then heated to 50°C, and an equal volume of lysis buffer (100 mM TrispH 8.0, 10 

mM EDTA, 54% sucrose, 2% SDS, 10 mM P-mercaptoethanol) is added to the sample. 
Proteinase K is then added to a final concentration of 0.6 mg/ml, and the viral lysate is 
incubated overnight at 50°C. The lysate is extracted three times with 50:49:1 
phenol :chloroform:isoamylalcohol and once with chloroform, and the DNA is precipitated 

25 in 0.4 M LiCl, 95% ethanol. This procedure typically yields 2 |ig of genomic AmEPV DNA 

per 10 7 infected cells. 
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Example 7 — Shuttle vector plasmid construction 

Following the sequence determination of two non-essential genes, thymidine kinase 
( TK) and spheroidin (Gruidl et al. [ 1 992] supra; Hall and Moyer [1991] supra), we were able 
to create shuttle vector plasmids for the generation of recombinant AmEPV viruses. The 
5 shuttle vectors are described below. 

A. The TK insertion site shuttle vector : Oligonucleotideprimers were used to PCR 
amplify a 748-bp fragment of downstream TK flanking sequence from plasmid pMEGTK-1 
( Gruidl et al [ 1 992] supra). Another set of oligonucleotideprimers was used to PCR amplify 
a 663-bp fragment of TK upstream flanking sequence from pMEGTK-1. These two 

10 fragments were separately inserted into pBluescript I SK(+ ) to produce pDUTK (Li, Y., Hall, 

R. L., and Moyer, R. W. [1997] 'Transient, nonlethal expression of genes in vertebrate cells 
by recombinantentomopoxviruses'V. Virol. 71:9557-9562). Foreign genes were then cloned 
within the TK flanks to generate shuttle vectors for the generation of recombinants (Li, Y., 
Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate 

15 cells by recombinant entomopoxviruses" J. Virol. 71:9557-9562). 

B. The spheroidin insertion site shuttle vector : Oligonucleotides were used to PCR 
amplify a 1046-bp fragment of upstream spheroidin flanking sequence from plasmid 
pRH512 (Hall and Moyer [1991] supra). In addition to 1023-bp of sequence upstream of the 
spheroidin gene, this fragment contained the starting ATG of the spheroidin coding sequence 

20 and twenty base pairs following the ATG. Another set of oligonucleotide primers was used 

to amplify a 998-bp fragment of downstream spheroidin flanking sequence from pRH512. 
These two fragments were separately inserted into pBluescript I SK(+) to produce pDU20 
(Hall, R. L., Li, Y., Feller, J. A., and Moyer, R. W. [1996] 'The Amsacta moorei 
entomopoxvirus spheroidin gene is improperly transcribed in vertebrate poxviruses" 

25 Virology 224:427-436). Subsequent constructs were cloned within the spheroidin flanks to 

generate various shuttle vectors for the generation of recombinants (Hall et al. [1996] 
Virology 224:427-436). 

C. AmEPV early promoter constructs : Promoters for early poxvirus genes are active 
prior to viral DNA replication. We have utilized two early EPV promoters in our constructs. 
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The first, an AmEPV early strong promoter (esp) was derived from a strongly expressed 42 
kDa early protein (Li, Y., Hall, R. L„ and Moyer, R. W. [1997] "Transient, nonlethal 
expression of genes in vertebrate cells by recombinant entomopoxviruses'V. Virol. 71 :9557- 
9562). The second promoter was derived from the early expressed fusolin (fits) gene as 
5 described (Gauthier et al [1 995] supra). These promoters have been used to regulate reporter 

genes (lacZ, gfp). The appropriately regulated genes are then placed within shuttle vectors 
and transfected into infected cells to produce recombinant viruses. The shuttle vector pTK- 
fuslacZ was constructed by PCR amplification of the MmEPV fusolin early promoter from 
pHF5 1 and insertion into pDUTK; lacZ was subcloned from pMC 1871 (Pharmacia Biotech, 

10 Inc., Piscataway,N.J.) as described (Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, 

nonlethal expression of genes in vertebrate cells by recombinant entomopoxviruses'V. Virol 
7 1 :9557-9562). The shuttle vector pTK-esp/acZ (pTK-42k/acZ) was constructed by cloning 
the PCR amplified esp promoter into pTK-fus/acZ after excision of the fusolin promoter (Li, 
Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in 

15 vertebrate cells by recombinant entomopoxviruses" J. Virol 71 :9557-9562). To construct 

pTK-espg#>, a green fluorescent protein gene (gfp) was PCR-amplified from the pTR-UF5 
plasmid (Vector Core, University of Florida) (18) and cloned into pTK-esp/acZ replacing the 
espl&cZ cassette as described in (Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, 
nonlethal expression of genes in vertebrate cells by recombinant entomopoxviruses'V. Virol 

20 71:9557-9562). 

D. AmEPV late promoter constructs : We have used the spheroidin (sph) promoter 
as an example of an AmEPV strong late promoter. This promoter has two rather unexpected 
properties: (1 ) the sph promoter appears to be insect cell specific and functions very poorly 
in vertebrate cells (Hall et al [1996] supra), and (2) unlike previously described late 

25 poxvirus promoters, we have found in insect cells that expression is further enhanced by 

including the 20 bp downstream of the TAAATG promoter in the reporter gene constructs 
(23). pDU20/acZ was created by insertion of lacZ (from plasmid pMC1871, Pharmacia 
Biotech, Inc., Piscataway, NJ) into the Bam\A\ site of pDU20. The final reporter contains 
1046 bp of potential spheroidin promoter sequence plus 20 bp of additional downstream 
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spheroidin coding sequence following the TAAATG sequence before fusion to lacZ (Hall 
et al [1996] supra). pDV2lacZ was constructed using the same strategy as that for 
pDU20/acZ, except that only 2 bp of spheroidin coding sequence follows the translation- 
starting TAAATG before fusion to /ac-Z (Hall et al [1996] supra. Li, Y., R.L. Half S. Yuan, 
5 and R.W. Moyer [1998] "High-level expression of Amsacta moorei entomopoxvirus 

spheroidin depends on sequences within the gene" J. Gen. Virol. 79:61 3-622). We have also 
constructed and used the cowpoxvirus late ATI gene promoter to drive lacZ which functions 
well in both insect and vertebrate cells (Li, Y., R.L. Hall, S. Yuan, and R.W. Moyer [ 1 998] 
"High-level expression of Amsacta moorei entomopoxvirus spheroidin depends on 

10 sequences within the gene" J. Gen. Virol. 79:613-622). 

E. Construct driven by Pol II specific promoters : An AmEPV construct containing 
reporter genes driven by Pol II rather than poxvirus promoters has also been prepared based 
on the plasmid pTR-UF5 (Vector Core, University of Florida) (Klein, R. L., E. M. Meyer, 
A. L. Peel, S. Zolotukhin,C. Meyers, N. Muzyczka,and M.A. King [1998] "Neuron-specific 

15 transduction in the rat septohippocampal or nigrostriatal pathway by recombinant adeno- 

associated virus vectors" Experimental Neurology 150:183-194) which is a plasmid 
containing two mammalian reporter genes: ( 1 ) a gene coding for neomycin resistance (neoR) 
driven by the herpes simplex virus TK promoter and (2) gfp driven by the immediate early 
promoter/enhancer from cytomegalovirus (CM V). The dual gene cassette is flanked by the 

20 AAV-inverted terminal repeats (ITRs). pTR-UF5 was digested with Sail to remove two Pst\ 

sites then religated to form pTRUF5XS#/L This construct was then digested with PstV and 
the fragment containing the two reporter genes was inserted into the Pst\ site of pDUTK to 
produce pTKUF5)Sa//. This construct was then digested with Sail and the previously 
removed Sail fragment was reinserted into the construct to produce pTKUFS (Figure 1). 

25 

Example 8 — Generation and Selection of recombinant AmEPV 

Neither the spheroidin gene nor the thymidine kinase gene is required for propagation 
of AmEPV in cell culture (Palmer, CP., D.P. Miller, S.A. Marlow. L.E. Wilson, A.M. 
Lawrie,and L.A. King [1995] "Genetic modification of an entomopoxvirus: deletion of the 
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spheroidin gene does not affect virus replication in vitro'* J. Gen. Virol, 76:15-23; R.W. 
Moyer, Li, Y. and Bawden, A., unpublished results), providing sites for insertion of foreign 
genes by homologous recombination. Following transfection of AmEPV infected LD-652 
cells, all foreign genes are inserted into either the TK or the spheroidin locus (Hall et al. 
5 [\996] supra). 

A. Transfection of infected cells : LD-652 cells (1.4 x 10 6 cells, 70% confluent in a 
34.6 mm dish) are infected with AmEPV at an m.o.i. of 5 PFU per cell in a volume of 1 ml. 
Two hours post-infection, the inoculum is aspirated and 1 ml of transfection mix + DNA is 
added. 

10 Transfection mix + DNA is prepared by separately combining 20 |il Lipofectin 

(Gibco, Gaithersburg,MD) and 80 \x\ 1 :1 TE media without FBS, and 5 jag of shuttle vector 
plasmid DNA in a volume of 100 ^1 of the same media. The mixture is incubated at room 
temperature for 1 5 min. The concentrated transfection mix is then diluted by addition of 800 
|Lil of 1 : 1 TE without FBS and added to cells. After 6 hr, the transfection mix is removed and 

15 replaced with 2 ml of 1 : 1 TE with 10% FBS. Three days post-infection the supernatant is 

harvested by centrifugation at 200 x g. 

B. Selection of AmEPV recombinants : For selection of recombinants inserted into 
the TK gene, a TK(-) cell line, CI 1.3. was derived by serial passage of LD-652 cells in 
increasing concentrations of S-bromo^'-deoxyuridine (BudR) (10 |ig/ml increasing 

20 increments of BudR at intervals of five weeks over one year). CI 1 .3 cells are maintained at 

in 1:1 TE medium supplemented with 100 (ig/ml BudR (Li, Y., Hall, R. L., and Moyer, R. 
W. [1997] 'Transient, nonlethal expression of genes in vertebrate cells by recombinant 
entomopoxviruses" J. Virol. 71:9557-9562). Supernatants harvested from the 
infection/transfection described above are plaqued on CI 1.3 cells in the presence of 100 

25 (ig/ml BudR. Only virus lacking a functional TK gene, i.e. recombinant virus, will grow in 

the presence of BudR. For selection of recombinants where insertion is within the spheroidin 
gene (where lacZ has been incorporated into the shuttle plasmid) recombinant virus plaques 
are identified following staining with Xgal as described above or by the selection of non- 
refractile plaques. 
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C. Purification of recombinants : Dilutions at 10" 2 to 10" 4 of the infected cell 
supernatant are assayed by repeated plaquing (described above). Individual plaques are 
isolated and diluted in 1 ml 1:1 TE medium plus 10% FBS. Isolates are replaqued at 
dilutions at 10" 1 to 10" 3 . The plaque purification is repeated 3-4 times prior to plaque 
5 expansion for the propagation of larger stocks. These viruses can be stored at 4 oC for 2-3 

months or at -80°C for long-term storage. 

Example 9 — Foreign gene expression in permissive insect cells 

In our hands, we find the level of foreign gene expression within infected insect cells 

10 when driven by either of two late promoters (the cowpox virus ATI or spheroidin +20 

promoters) equals or exceeds that of vaccinia or baculoviruses on a per/cell basis. The 
technology needs no elaboration, as once suitable recombinant viruses are constructed, only 
appropriate infection of insect cells is necessary. 
Example 10 — Transient expression in vertebrate cells 

15 The normal host range of AmEPV is limited to Lepidoptera (butterflies), and early 

experiments attempting to infect vertebrate cells with AmEPV indicated no obvious 
deleterious effects on the cells. Given the general promiscuity of poxviruses in the binding 
and entering of cells and the similarity of the AmEPV life cycle to that of vaccinia, we had 
reason to believe AmEPV would infect and enter vertebrate as well as insect cells. AmEPV 

20 recombinants were constructed carrying the lacZ reporter gene regulated by either of two 

early AmEPV promoters, the late spheroidin promoter or the ATI promoter from cowpox 
(TK-fus/atZ TK-esp/t/cZ, SPH(20)/acZ and TK-ATI/aeZ, described above). When 
mammalian CV-1 cells were infected with the recombinant viruses at an m.o.i. of 10 
PFU/cell, those cells infected with AmEPV recombinants where lacZ was regulated by either 

25 the MmEP V fusolin or the AmEPV 42kD protein early promoters, expressed p-galactosidase 

(Figure 3, panels D and E) (Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, 
nonlethal expression of genes in vertebrate cells by recombinant entomopoxviruses'V. Virol 
71 :9557-9562). No expression was observed for the recombinants where lacZ was under 
control of either of the two late promoters (Figure 3. panels B and C). Thus, AmEPV enters 
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vertebrate cells and undergoes at least a partial uncoating which allows early, pre-replicative 
viral genes to be expressed. The infection is blocked thereafter and neither viral DNA 
replication nor late protein synthesis is observed. Early expression results from the fact that 
AmEPV, like other poxviruses, packages the enzymes necessary for early gene transcription 
5 within the virion. However, if vertebrate cells are co-infected with both VV and AmEPV, 

late promoters within AmEPV are rescued and activated suggesting that vaccinia can provide 
factors in trans which are needed for the infection to progress and activate the late promoters. 

While determining the basis of host range restriction is difficult, the cytoplasmic 
nature of AmEPV coupled with a virus encoded transcription and replication machinery 
10 offers major advantages for vector design. We can be fairly certain that late genes are not 

transcribed because of the lack of P-galactosidase expression from late promoters in 
vertebrate cells and because DNA synthesis, a requirement for late mRNA synthesis does not 
occur. It is quite possible that incomplete uncoating of the virus leads to the block in gene 
expression. 

15 Normally, uncoating of poxviruses occurs in two discrete steps. Upon entry into 

cells, virions are sufficiently permeabilized to allow early gene transcription from the viral 
core. Early proteins allow the complete uncoating of the core to allow transcription of the 
later classes of genes following interaction of newly synthesized DNA with intermediate and 
late transcription factors. The uncoating of vertebrate poxviruses has been thoroughly 

20 studied, and uncoating intermediates have been identified through differential centrifugation 

of cellular extracts infected with labeled virus. A viral activity specifically required for the 
second stage of uncoating has been identified. By analogy with vaccinia, AmEPV might be 
expected to encode a similar uncoating factor. If so, then one would not necessarily expect 
a cell-line specific block in uncoating unless this uncoating protein acts in conjunction with 

25 cellular components. Should we find that the particle is uncoated, then AmEPV fails to 

express other genes in vertebrate cells required for the infection to continue. Host range 
restriction of another insect virus family, the Baculoviridac\ has received considerable 
attention. For baculoviruses, blockage may occur at many stages during the activation of late 
or very late genes after viral DNA enters the nucleus (Carbonell, L.F., M.J. Klowden, and 
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L.K. Miller [1985] "Baculovirus-mediated expression of bacterial genes in dipteran and 
mammalian cells" ./ Virol. 56:153-160; Carbonell, L.F. and L.K. Miller [1987] 
"Baculovirus interaction with nontarget organisms: a virus-borne reporter gene is not 
expressed in two mammalian cell lines" Appl Environ. Microbiol. 53:1412-1417; Morris, 
5 T.D. and L.K. Miller [1992] "Promoter influence on baculovirus-mediated gene expression 

in permissiveand nonpermissive insect cell lines"*/ Virol. 66:7397-7405; Morris, T.D. and 
L.K. Miller [1993] "Characterizationof productive and non-productive AcMNPV infection 
in selected insect cell lines" Virology 197:339-348). The questions related to host range 
specificity are outside the scope of this review, but the limit of expression to those genes 

10 under the control of early promoters following infection of vertebrate cells is a key property 

of the virus which ultimately makes it a potential candidate as a vector for transient 
expression in vertebrate cells. 

The most novel feature of this system is the survival and continued growth of the 
infected vertebrate cells. We immediately noticed that there were no observable cytopathic 

15 effects in the infected CV-1 cells. Survival and growth of the infected cells was shown 

following infection with recombinant AmEPV TK-espg#>. Initially, individual fluorescent 
cells resulted, which over a period of two to three days, divided to form fluorescent 
microclusters of cells (Figure 4). There is no other known precedent of cells surviving a 
poxvirus infection. The fact that mammalian cells survive suggests that AmEPV offers the 

20 potential for a highly efficient, nontoxic method of foreign gene delivery into vertebrate cells 

for transient expression of foreign genes, after which the cells continue to grow unabated. 
While initial observations were made using CV-1 cells, these results have been extended to 
many other cell lines. In general, lymphocytic cells are more resistant to infection (Li, Y., 
Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate 

25 cells by recombinant entomopoxviruses" J. Virol 71:9557-9562). 



Example 1 1 — AmEPV mediated gene expression in the mouse 

To examine the feasibility of AmEPV to deliver and express foreign genes in vivo, 
we examined the effects of injection of TK-esp/c/cZ and SPH(20)/c/cZ into mouse muscle. 
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Approximately 2 x 106 PFU (100 |ul) of each virus was injected into the muscle of the hind 
leg of two separate mice; as an additional control, we also injected 100 \il PBS into a third 
mouse. Two days post infection, the mice were sacrificed and the muscle was excised into 
small pieces, fixed in a 3% formaldehyde solution for 30 min, and stained with X-gal. 
5 Extensive P-galactosidase expression occurred in the muscle from the TK-esp/acZ infected 

mouse (Figure 5). No expression was seen in either the SPH(20)/acZ or the PBS control. 
Thus, consistent with our observations of infected mammalian cells in culture, AmEPV can 
also enter cells in vivo and allow early, but not late expression of a reporter gene. 

10 Example 12 — The control of AmEPV induced inflammation 

One concern with complex viral vectors is the potential for the unintended induction 
of inflammatory and immunological responses following administration. In studies with 
adenovirus, inflammation and immunogenicity to the virus and to virus-infected cells has 
limited transgene expression and the utility of this approach to treat chronic illnesses. 

15 Inflammation is initially characterized by perivascularand peribronchiolar inflammatory cell 

infiltration. Neutrophils and later macrophages and lymphocytes frequent the site of the 
infected area. Specific cytokines can also be measured as an index of the inflammatory 
response (Ginsberg, H.S., L.L. Moldawer. P.B. Sehgal, M. Redington, P.L. Kilian, R.M. 
Chanock, and G.A. Prince [1991] "A mouse model for investigating the molecular 

20 pathogenesis of adenovirus pneumonia'' Proc. Natl. Acad. Sci. U.S.A. 88:1651-1655;Noah, 

T.L., I.A. Wortman, P.C. Hu, M.W. Leigh, and R.C. Boucher [1996] "Cytokine production 
by cultured human bronchial epithelial cells infected with a replication-deficient adenoviral 
gene transfer vector or wild-type adenovirus type 5" Am. J. Respir. Cell Mol Biol. 14:417- 
424). The early response to adenovirus infection consists of diffuse cellular infiltration of 

25 peribronchiolar and alveolar regions associated with the appearance of several classes of 

pro-inflammatory cytokines (Ginsberg et al [1991 ] supra: Noah et al [\996\ supra). These 
include TNF-a, IL-1, IL-6, and IL-8 (KC/GRO in the mouse). There is considerable 
experimental evidence from rodents demonstrating that these classes of cytokines, and in 
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particular TNF-a and IL-8 (or KC/GRO), play central roles in the recruitment and activation 
of inflammatory cell populations in the lung. 

It is likely with a virus as complex as AmEPV that unintended inflammation will 
result when the virus is introduced in vivo. However, vertebrate poxviruses may serve as a 
5 source of genes to provide a solution to this problem. There have been a variety of vertebrate 

poxvirus-encoded secreted virokines and viroceptors described including IFN-7P, IFN-(, 
TNF and IL- 1 , and chemokine receptors (Barry, M. and G. McFadden [ 1 997] "Virus encoded 
cytokines and cytokine receptors" Parasitology 1 15:S89-1 00; Smith, G.L., J. A. Symons, A. 
Khanna, A. Vanderplasschen, and Alcami, A. [1997] "Vaccinia virus immune evasion" 
10 Immunol Rev. 159:137-154:137-154; Turner, P.C. and R.W. Moyer [1998] "Control of 

apoptosis by poxviruses" Seminars in Virology 8:453-469). Should inflammation be 
observed, cloning any or all of these genes into AmEPV to control any AmEPV induced 
inflammation is quite plausible. 

15 Example 13 — The use of AmEPV to stably transform mammalian cells 

A. Considerations in the design of AmEPV vectors for stable transformation : 
Poxviruses are cytoplasmic and therefore poxvirus promoters are recognized only by the 
poxvirus encoded RNA polymerase and not by RNA polymerase II of the host cell. Since 
we wished to demonstrate the ability of AmEPV to mediate stable transformation of 

20 mammalian cells, we constructed a recombinant AmEPV containing a cassette in which gfp 

and a gene conferring neomycin resistance were cloned under the control of promoters 
recognized by the cellular (not poxvirus) RNA polymerase (Figure 6). The promoters 
chosen were the CMV immediate early and herpesvirus TK gene promoters respectively, and 
the cassette was flanked by the inverted terminal repeat (ITR) sequences of AAV. 

25 Although the exact mechanism of AAV site specific integration is unknown, two 

required viral components have been identified. These are the inverted terminal repeat 
sequences of AAV DNA (ITRs) and the Rep 78/68 proteins. The ITRs comprise two 145- 
nucleotide elements located at either end of the AAV genome. ITR sequences enclosing 
marker genes have been shown to allow a lower level of random genome integration when 
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compared to the levels of specific integration observed when genes encoding the Rep 78/68 
proteins are also included in constructs. 

B. Selection of stable recombinants : When vertebrate cells are transfected with 
plasmid pTK-UF5, and selected with G4 1 8 (250mg/ml) 24 hours later, colonies of resistant, 
5 fluorescent vertebrate cells are observed after about ten days of selection. In contrast, 

colonies from cells infected with AmEPVpTK-UF5 at an m.o.i. of five were G41 8 resistant, 
but not initially GFP positive. Approximately 20 G41 8-resistant clones can be isolated 
following infection of one million cells (one 35 mm dish). The efficiency of transformation 
is only 5-10 fold less than that for optimized plasmid mediated transfections. These 

10 AmEPV-derived colonies were observed to initially grow more slowly in the presence of G- 

4 1 8 than those produced from a plasmid transfection. Most likely, the explanation for these 
results is that the GFP and NeoR gene copy number in AmEPV-derived colonies is less than 
that for clones derived from cells transfected with the pTKUF5 plasmid. This explanation 
is further supported by the observation that the AmEPV derived colonies gradually became 

15 more resistant to G418 and that GFP expression while delayed, is ultimately observed. 

Fluorescent cells became both more numerous and brighter, consistent with a gradual 
increase in GFP concentration with time. Ultimately all cells in each clonal isolate were GFP 
positive, as shown in Figure 6. 

We are currently characterizing these cell lines, which have been grown reliably for 

20 multiple generations. Our unpublished data suggests random integration of the marker 

cassette into the cellular genome. Inclusion of the AAV rep gene in future constructs is 
anticipated to provide directed insertion into chromosome 19 (Samulski, R.J., X. Zhu, X. 
Xiao, J.D. Brook, D.E. Housman,N. Epstein and L.A. Hunter [1991] "Targeted integration 
of adeno-associated virus (AAV) into human chromosome 19" EMBO J. 10:3941-3950). 
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It should be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview 
of this application and the scope of the appended claims. 
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